Polynucleotides associated with cardiac muscle function

ABSTRACT

The invention provides compositions and novel polynucleotides and their encoded proteins that serve as surrogate markers in that they co-express with genes known to be involved associated with disorders associated with cardiac muscle function. The invention also provides expression vectors, host cells, proteins encoded by the polynucleotides and antibodies which specifically bind the proteins. The invention also provides methods for the diagnosis, prognosis, evaluation of therapies and treatment of disorders associated with cardiac muscle function.

[0001] This application is a continuation of pending prior applicationU.S. Ser. No.09/880,192, filed on Jun. 12, 2001, entitledPOLYNUCLEOTIDES ASSOCIATED WITH CARDIAC MUSCLE FUNCTION, which in turnis a continuation-in-part of U.S. Ser. No 09/299,708, filed Apr. 26,1999, now abandoned, entitled CARDIAC MUSCLE-ASSOCIATED GENES, both ofwhich are hereby expressly incorporated by reference.

FIELD OF THE INVENTION

[0002] The invention relates to 48 polynucleotides associated withcardiac muscle function that were identified by their coexpression withknown cardiac muscle-associated genes. The invention also relates to theuse of these polynucleotides, their encoded proteins and antibodieswhich specifically bind the proteins in diagnosis, prognosis, treatment,and evaluation of therapies for disorders associated with cardiac musclefunction.

BACKGROUND OF THE INVENTION

[0003] Vertebrates have three classes of muscle: skeletal, smooth, andcardiac. Skeletal and cardiac muscles have a striped appearance in thelight microscope and are therefore called striated. Cardiac muscleresembles skeletal muscle in many respects, but it is specialized forthe continuous, involuntary, rhythmic contractions needed for pumpingblood. Smooth muscles lack striations and surround internal organs suchas the intestines, the uterus, and large blood vessels. Skeletal muscleis under the voluntary control of the nervous system. Cardiac muscle andsmooth muscle are under the involuntary control of the nervous system.Compared with striated muscles, smooth muscle cells contract and relaxslowly and can create and maintain tension for long periods of time.

[0004] Muscle tissue is composed of bundles of multinucleated musclecells (myofibers). Each muscle cell contains bundles of actin and myosinfilaments (myofibrils) which extend the length of the cell. Themyofibril is composed of a chain of sarcomeres. The sarcomere is thefunctional unit of contraction. Myosin filaments are sandwiched betweenalternating layers of actin filaments. Myosin filaments are composed ofheavy and light chain proteins. Actin filaments are capped by twoproteins, capZ and tropomodulin. In addition, the myosin-binding sitesof actin filaments are protected by the tropomyosin-troponin regulatorycomplex. Contraction of muscle is initiated by actionpotential-stimulated release from the sarcoplasmic reticulum of calciumions into the cell to levels greater than 10⁻⁶ M. Binding of calciumions to troponin causes tropomyosin to move towards the center of theactin filament. This movement exposes the myosin-binding sites of actin.Prior to contraction, the N-terminal domain of the myosin heavychain-light chain complex (myosin head) forms a cross-bridge with actinfilaments. Binding of ATP to the myosin head causes dissociation ofmyosin from actin. This is followed by a conformational change of themyosin head and hydrolysis of ATP. The myosin head then forms a newcross-bridge with actin filaments. Successive cycle of ATP-binding,dissociation from actin, conformational changes, ATP hydrolysis, andcrossbridge formation results in muscle contraction. Relaxation isinitiated when calcium ion levels in the cell fall below 10⁻⁶ M. At thatlevel, calcium ions dissociate from troponin, which then shields themyosin-binding sites of actin.

[0005] Gap junctions, very permeable parts of the cell membrane, connectindividual muscle cells with each other. Through these gap junctions,ions diffuse relatively freely and transmit action potentials to allmuscle cells.

[0006] Differentiation of muscle cells during embryogenesis and ontogenyis regulated by a number of nuclear transcription factors such asmyogenin, MyoD, MEF2A, and myf-5, and by cell cycle proteins such asp21, p57, and RB. Expression of the genes which encode some of thesemyogenic regulatory proteins has been correlated with certain type oftumor and other disorders (Wang et al. (1995) Am J Pathol 147:1799-1810;Miyagawa et al.(I998) Nat Genet 18:15-17; and Sedehizade et al.(I997)Muscle Nerve 20:186-194).

[0007] Contemporary techniques for diagnosis of cardiac muscleabnormalities rely mainly on observation of clinical symptoms,electrocardiograms, and serological analyses of metabolites and enzymes.Relatively mild symptoms in the earlier stages of heart disease may evenbe overlooked. In addition, the serological analyses of the limitednumber of hormones or peptides do not always differentiate among thosediseases or syndromes which have overlapping or near-normal ranges ofhormonal or marker protein levels. Thus, development of new techniques,such as microarrays and transcript imaging, will contribute to the earlyand accurate diagnosis or to a better understanding of molecularpathogenesis of cardiac disorders.

[0008] The present invention satisfies a need in the art by providingnew compositions that are useful for diagnosis, prognosis, treatment,and evaluation of therapies for disorders associated with cardiac musclefunction.

SUMMARY OF THE INVENTION

[0009] The invention provides a composition comprising a plurality ofpolynucleotides having the nucleic acid sequences of SEQ ID NOs: 1-48that are highly significantly co-expressed with known the cardiacmuscle-associated genes: atrial regulatory myosin, ventricular myosinalkali light chain, cardiac troponin, cardiac ventricular myosin,cardiodilatin, creatine kinase M, myoglobin, natriuretic peptideprecursor, sarcomeric mitochondrial creatine kinase, telethonin, titin,and urocortin.

[0010] The invention also provides an isolated polynucleotide comprisinga nucleic acid sequence selected from SEQ ID NOs: 1-48 and thecomplements thereof. In different aspects, the polynucleotide is used asa surrogate marker, as a probe, in an expression vector, and in thediagnosis, prognosis, evaluation of therapies and treatment of disorderssuch as atherosclerosis, arteriosclerosis, atrial fibrillation, cancer(myxoma) and complications of cancer, cardiac injury, congestive heartfailure, coronary artery disease, hypertension, hypertrophiccardiomyopathy, myocardial hypertrophy, myocardial infarction, andplaque. The invention further provides a composition comprising apolynucleotide and a labeling moiety.

[0011] The invention provides a method for using a composition or apolynucleotide to screen a plurality of molecules and compounds toidentify or to purify ligands which specifically bind to the compositionor the polynucleotide. The molecules are selected from DNA molecules,RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes,transcription factors, enhancers, and repressors.

[0012] The invention provides a method for using a composition or apolynucleotide to detect gene expression in a sample by hybridizing thecomposition or polynucleotide to nucleic acids of the sample underconditions for formation of one or more hybridization complexes anddetecting hybridization complex formation, wherein complex formationindicates gene expression in the sample. In one aspect, the compositionor polynucleotide is attached to a substrate. In another aspect, thenucleic acids of the sample are amplified prior to hybridization. In yetanother aspect, complex formation is compared with at least one standardand indicates the presence of a disorder.

[0013] The invention provides a purified protein or a portion thereofselected from SEQ ID NOs:49-62, which is encoded by a polynucleotidethat is highly significantly co-expressed with genes known to involvedin disorders associated with cardiac muscle function. The invention alsoprovides a method for using a protein to screen a plurality of moleculesto identify or to purify at least one ligand which specifically bindsthe protein. The molecules are selected from aptamers, DNA molecules,RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes,proteins, antibodies, agonists, antagonists, immunoglobulins,inhibitors, pharmaceutical agents or drug compounds.

[0014] The invention provides a method of using a protein to make anantibody comprising immunizing a animal with the protein underconditions to elicit an antibody response, isolating animal antibodies,attaching the protein to a substrate, contacting the substrate withisolated antibodies under conditions to allow specific binding to theprotein, and dissociating the antibodies from the protein, therebyobtaining purified antibodies. The invention also provides a method forusing the antibody to detect expression of a protein in a sample, themethod comprising combining the antibody with a sample under conditionswhich allow the formation of antibody:protein complexes, and detectingcomplex formation, wherein complex formation indicates expression of theprotein in the sample. The invention also provides a compositioncomprising a polynucleotide, a protein, or an antibody that specificallybinds a protein and a labeling moiety or a pharmaceutical carrier.

BRIEF DESCRIPTION OF THE SEQUENCE LISTING AND TABLES

[0015] The Sequence Listing provides exemplary polynucleotide sequences,SEQ ID NOs: 1-48, and polypeptide sequences, SEQ ID NOs: 49-62. Eachsequence is identified by a sequence identification number (SEQ ID NO)and by the Incyte clone number with which the sequence was firstidentified.

[0016] Table 1 presents the results of co-expression analysis. Theentries in the table are the p-values which link the novelpolynucleotides with known marker genes.

[0017] Table 2 shows the characterization of proteins having the aminoacid sequences of SEQ ID NO: 49-62.

DESCRIPTION OF THE INVENTION

[0018] It must be noted that as used herein and in the appended claims,the singular forms “a”, “an”, and “the” include the plural referenceunless the context clearly dictates otherwise. Thus, for example, areference to “a host cell” includes a plurality of such host cells, anda reference to “an antibody” is a reference to one or more antibodiesand equivalents thereof known to those skilled in the art, and so forth.

[0019] Definitions

[0020] “Markers” refer to polynucleotides, proteins, and antibodieswhich are useful in the diagnosis, prognosis, evaluation of therapiesand treatment of disorders associated with cardiac muscle function.Typically, this means that the marker gene or polynucleotide isdifferentially expressed in samples from subjects predisposed to,manifesting, or diagnosed with disorders associated with cardiac musclefunction.

[0021] “Differential expression” refers to an increased or up-regulatedor a decreased or down-regulated expression as detected by presence,absence or at least about a two-fold change in the amount of transcribedmessenger RNA or protein in a sample.

[0022] “Disorders associated with cardiac muscle function” specificallyinclude, but are not limited to, the following conditions, diseases, anddisorders: atherosclerosis, arteriosclerosis, atrial fibrillation,cancer (myxoma) and complications of cancer, cardiac injury, congestiveheart failure, coronary artery disease, hypertension, hypertrophiccardiomyopathy, myocardial hypertrophy, myocardial infarction, andplaque.

[0023] “Isolated or purified” refers to a polynucleotide or protein thatis removed from its natural environment and that is separated from othercomponents with which it is naturally present.

[0024] “Genes known to be highly, and differentially, expressed incardiac muscle function” which were used in the co-expression analysisincluded atrial regulatory myosin, ventricular myosin alkali lightchain, cardiac troponin, cardiac ventricular myosin, cardiodilatin,creatine kinase M, myoglobin, natriuretic peptide precursor, sarcomericmitochondrial creatine kinase, telethonin, titin, and urocortin.

[0025] “Polynucleotide” refers to an isolated cDNA. It can be of genomicor synthetic origin, double-stranded or single-stranded, and combinedwith vitamins, minerals, carbohydrates, lipids, proteins, or othernucleic acids to perform a particular activity or form a usefulcomposition.

[0026] “Protein” refers to a purified polypeptide whether naturallyoccurring or synthetic.

[0027] “Sample” is used in its broadest sense. A sample containingnucleic acids can comprise a bodily fluid; an extract from a cell; achromosome, organelle, or membrane isolated from a cell; genomic DNA,RNA, or cDNA in solution or bound to a substrate; a cell; a tissue; atissue print; and the like.

[0028] “Substrate” refers to any rigid or semi-rigid support to whichpolynucleotides or proteins are bound and includes membranes, filters,chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels,capillaries or other tubing, plates, polymers, and microparticles with avariety of surface forms including wells, trenches, pins, channels andpores.

[0029] A “transcript image” is a profile of gene transcription activityin a particular tissue at a particular time.

[0030] A “variant” refers to a polynucleotide or protein whose sequencediverges from about 5% to about 30% from the nucleic acid or amino acidsequences of the Sequence Listing.

[0031] The Invention

[0032] The present invention employed “guilt by association (GBA)”, amethod for using marker genes known to be associated with cardiac musclefunction to identify surrogate markers, polynucleotides that aresimilarly associated or co-expressed in the same tissues, pathways ordisorders (Walker and Volkmuth (1999) Prediction of gene function bygenome-scale expression analysis: prostate-associated genes. Genome Res9:1198-1203, incorporated herein by reference). The genes known to beassociated with cardiac muscle function are atrial regulatory myosin,ventricular myosin alkali light chain, cardiac troponin, cardiacventricular myosin, cardiodilatin, creatine kinase M, myoglobin,natriuretic peptide precursor, sarcomeric mitochondrial creatine kinase,telethonin, titin, and urocortin. In particular, the method identifiescDNAs cloned from mRNA transcripts which were active in tissues removedfrom subjects with cardiac disorders including, but not limited to,atherosclerosis, arteriosclerosis, atrial fibrillation, cancer (myxoma)and complications of cancer, cardiac injury, congestive heart failure,coronary artery disease, hypertension, hypertrophic cardiomyopathy,myocardial hypertrophy, myocardial infarction, and plaque. Thepolynucleotides, their encoded proteins and antibodies whichspecifically bind to the encoded proteins are useful in the diagnosis,prognosis, evaluation of therapies, and treatment of disordersassociated with cardiac muscle function. U.S. Ser. No. 09/299,708 isincorporated in its entirety by reference herein.

[0033] Guilt by association provides for the identification ofpolynucleotides that are expressed in a plurality of libraries. Thepolynucleotides represent genes of unknown function which areco-expressed in a specific pathway, disease process, subcellularcompartment, cell type, tissue, or species. The expression patterns ofthe genes known to be highly and differentially expressed during cardiacmuscle function; atrial regulatory myosin, ventricular myosin alkalilight chain, cardiac troponin, cardiac ventricular myosin,cardiodilatin, creatine kinase M, myoglobin, natriuretic peptideprecursor, sarcomeric mitochondrial creatine kinase, telethonin, titin,and urocortin; are compared with those of polynucleotides with unknownfunction to determine whether a specified co-expression probabilitythreshold is met. Through this comparison, a subset of thepolynucleotides having a high co-expression probability with the knownmarker genes can be identified.

[0034] The polynucleotides originate from human cDNA libraries. Thesepolynucleotides can also be selected from a variety of sequence typesincluding, but not limited to, expressed sequence tags (ESTs), assembledpolynucleotides, full length coding regions, and 3′ untranslatedregions. To be considered in GBA or co-expression analysis, thepolynucleotides had to have been expressed in at least five cDNAlibraries. In this application, GBA was applied to a total of 45,233assembled polynucleotide bins that met the criteria of having beenexpressed in at least five libraries.

[0035] The cDNA libraries used in the co-expression analysis wereobtained from adrenal gland, biliary tract, bladder, blood cells, bloodvessels, bone marrow, brain, bronchus, cartilage, chromaffin system,colon, connective tissue, cultured cells, embryonic stem cells,endocrine glands, epithelium, esophagus, fetus, ganglia, heart,hypothalamus, hemic/immune system, intestine, islets of Langerhans,kidney, larynx, liver, lung, lymph, muscles, neurons, ovary, pancreas,penis, phagocytes, pituitary, placenta, pleura, prostate, salivaryglands, seminal vesicles, skeleton, spleen, stomach, testis, thymus,tongue, ureter, uterus, and the like. The number of cDNA librariesanalyzed can range from as few as three to greater than 10,000 andpreferably, the number of the cDNA libraries is greater than 500.

[0036] In a preferred embodiment, the polynucleotides are assembled fromrelated sequences, such as sequence fragments derived from a singletranscript. Assembly of the polynucleotide can be performed usingsequences of various types including, but not limited to, ESTs,extension of the EST, shotgun sequences from a cloned insert, or fulllength cDNAs. In a most preferred embodiment, the polynucleotides arederived from human sequences that have been assembled using thealgorithm disclosed in U.S. Ser. No. 9,276,534, filed Mar. 25, 1999, andused in U.S. Ser. No. 09/226,994, filed Jan. 7, 1999, both incorporatedherein by reference.

[0037] Experimentally, differential expression of the polynucleotidescan be evaluated by methods including, but not limited to, differentialdisplay by spatial immobilization or by gel electrophoresis, genomemismatch scanning, representational difference analysis, and transcriptimaging. For example, the results of transcript imaging for SEQ ID NOs:29 and 44 are shown in Example IX. Differential expression of SEQ ID NO:29 is highly specifically correlated with hypertension, and SEQ ID NO:44, with myocardial infarction. The transcript image provided directconfirmation of the strength of co-expression analysis—the use of knowngenes to identify unknown polynucleotides and their encoded proteinswhich are highly significantly associated with disorders associated withcardiac muscle function. Additionally, differential expression can beassessed by microarray technology. These methods can be used alone or incombination.

[0038] Genes known to be highly expressed in disorders associated withcardiac muscle function can be selected based on research in which thegenes are found to be key elements of biochemical or signaling pathwaysor on the known use of the genes as diagnostic or prognostic markers ortherapeutic targets for such disorders. Preferably, the known genes areatrial regulatory myosin, ventricular myosin alkali light chain, cardiactroponin, cardiac ventricular myosin, cardiodilatin, creatine kinase M,myoglobin, natriuretic peptide precursor, sarcomeric mitochondrialcreatine kinase, telethonin, titin, and urocortin.

[0039] The procedure for identifying novel polynucleotides that exhibita statistically significant co-expression pattern with known genes is asfollows. First, the presence or absence of a polynucleotide in a cDNAlibrary is defined: a polynucleotide is present in a cDNA library whenat least one cDNA fragment corresponding to the polynucleotide isdetected in a cDNA from that library, and a polynucleotide is absentfrom a library when no corresponding cDNA fragment is detected.

[0040] Second, the significance of co-expression is evaluated using aprobability method to measure a due-to-chance probability of theco-expression. The probability method can be the Fisher exact test, thechi-squared test, or the kappa test. These tests and examples of theirapplications are well known in the art and can be found in standardstatistics texts (Agresti (1990) Categorical Data Analysis, John Wiley &Sons, New York N.Y.; Rice (1988) Mathematical Statistics and DataAnalysis, Duxbury Press, Pacific Grove Calif.). A Bonferroni correction(Rice, supra, p. 384) can also be applied in combination with one of theprobability methods for correcting statistical results of onepolynucleotide versus multiple other polynucleotides. In a preferredembodiment, the due-to-chance probability is measured by a Fisher exacttest, and the threshold of the due-to-chance probability is setpreferably to less than 0.001, more preferably to less than 0.00001.

[0041] For example, to determine whether two genes, A and B, havesimilar co-expression patterns, occurrence data vectors can be generatedas illustrated in the table below. The presence of a gene occurring atleast once in a library is indicated by a one, and its absence from thelibrary, by a zero. Library 1 Library 2 Library 3 — Library N Gene A 1 10 — 0 Gene B 1 0 1 — 0

[0042] For a given pair of genes, the occurrence data in the table abovecan be summarized in a 2×2 contingency table. The second table (below)presents co-occurrence data for gene A and gene B in a total of 30libraries. Both gene A and gene B occur 10 times in the libraries. GeneA Present Gene A Absent Total Gene B Present  8  2 10 Gene B Absent  218 20 Total 10 20 30

[0043] The second table summarizes and presents: 1) the number of timesgene A and B are both present in a library; 2) the number of times geneA and B are both absent in a library; 3) the number of times gene A ispresent, and gene B is absent; and 4) the number of times gene B ispresent, and gene A is absent. The upper left entry is the number oftimes the two genes co-occur in a library, and the middle right entry isthe number of times neither gene occurs in a library. The off diagonalentries are the number of times one gene occurs, and the other does not.Both A and B are present eight times and absent 18 times. Gene A ispresent, and gene B is absent, two times; and gene B is present, andgene A is absent, two times. The probability (“p-value”) that the aboveassociation occurs due to chance as calculated using a Fisher exact testis 0.0003.

[0044] This method of estimating the probability for co-expression makesseveral assumptions. The method assumes that the libraries areindependent and are identically sampled. However, in practicalsituations, the selected cDNA libraries are not entirely independent,because more than one library can be obtained from a single subject ortissue. Nor are they entirely identically sampled, because differentnumbers of cDNAs can have been sequenced from each library. The numberof cDNAs sequenced typically ranges from 5,000 to 10,000 cDNAs perlibrary. After the Fisher exact co-expression probability is calculatedfor each polynucleotide versus all other assembled polynucleotides thatoccur, a Bonferroni correction for multiple statistical tests isapplied.

[0045] Using the method of the present invention, we have identifiedpolynucleotides, SEQ ID NOs: 1-48 and their encoded proteins, SEQ IDNOs: 49-62, that exhibit highly significant co-expression probabilitywith known marker genes for disorders associated with cardiac musclefunction. The results presented in Example VI show the directassociations among the novel polynucleotides and the known marker genesfor disorders associated with cardiac muscle function. Therefore, bythese associations, the novel polynucleotides are useful as surrogatemarkers for the co-expressed known markers in diagnosis, prognosis,evaluation of therapies and treatment of disorders associated withcardiac muscle function. Further, the proteins or peptides expressedfrom the novel polynucleotides are either potential therapeutics ortargets for the identification and/or development of therapeutics.

[0046] In one embodiment, the present invention encompasses acomposition comprising a plurality of polynucleotides having the nucleicacid sequences of SEQ ID NOs: 1-48 or the complements thereof. These 48polynucleotides are shown by the method to have significantco-expression with known markers for disorders associated with cardiacmuscle function. The invention also provides a polynucleotide, itscomplement, a probe comprising the polynucleotide or the complementthereof selected from SEQ ID NOs: 1-48.

[0047] The polynucleotide can be used to search against the GenBankprimate (pri), rodent (rod), mammalian (mam), vertebrate (vrtp), andeukaryote (eukp) databases; the encoded protein, against GenPept,SwissProt, BLOCKS (Bairoch et al. (1997) Nucleic Acids Res 25:217-221),PFAM, and other databases that contain previously identified andannotated protein sequences, motifs, and gene functions. Methods thatsearch for primary sequence patterns with secondary structure gappenalties (Smith et al. (1992) Protein Engineering 5:35-51) as well asalgorithms such as Basic Local Alignment Search Tool (BLAST; Altschul(1993) J Mol Evol 36:290-300; Altschul et al. (1990) J Mol Biol215:403-410), BLOCKS (Henikoff and Henikoff (1991) Nucleic Acids Res19:6565-6572), Hidden Markov Models (HMM; Eddy (1996) Cur Opin Str Biol6:361-365; Sonnhammer et al. (1997) Proteins 28:405-420), and the like,can be used to manipulate and analyze nucleotide and amino acidsequences. These databases, algorithms and other methods are well knownin the art and are described in Ausubel et al. (1997; Short Protocols inMolecular Biology, John Wiley & Sons, New York N.Y., unit 7.7) and inMeyers (1995; Molecular Biology and Biotechnology, Wiley VCH, New YorkN.Y., p 856-853).

[0048] Also encompassed by the invention are polynucleotides that arecapable of hybridizing to SEQ ID NOs: 1-48 and the complements thereofunder highly stringent conditions. Stringency can be defined by saltconcentration, temperature, and other chemicals and conditions wellknown in the art. Conditions can be selected, for example, by varyingthe concentrations of salt in the prehybridization, hybridization, andwash solutions or by varying the hybridization and wash temperatures.With some substrates, the temperature can be decreased by adding asolvent such as formamide to the prehybridization and hybridizationsolutions.

[0049] Hybridization can be performed at low stringency, with bufferssuch as 5×SSC (saline sodium citrate) with 1% sodium dodecyl sulfate(SDS) at 60C., which permits complex formation between two nucleic acidsequences that contain some mismatches. Subsequent washes are performedat higher stringency with buffers such as 0.2×SSC with 0.1 % SDS ateither 45° C. (medium stringency) or 68° C. (high stringency), tomaintain hybridization of only those complexes that contain completelycomplementary sequences. Background signals can be reduced by the use ofdetergents such as SDS, sarcosyl, or TRITON X-100 (Sigma-Aldrich, St.Louis Mo.), and/or a blocking agent, such as salmon sperm DNA.Hybridization methods are described in detail in Ausubel (supra, units2.8-2.11, 3.18-3.19 and 4-6-4.9) and Sambrook et al. (1989; MolecularCloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview N.Y.).

[0050] A polynucleotide can be extended utilizing primers and employingvarious PCR-based methods known in the art to detect upstream sequencessuch as promoters and other regulatory elements. (See, e.g., Dieffenbachand Dveksler (1995) PCR Primer, a Laboratory Manual, Cold Spring HarborPress, Plainview N.Y.) Commercially available kits such as XL-PCR(Applied Biosystems (ABI), Foster City Calif.), cDNA libraries (LifeTechnologies, Rockville Md.) or genomic libraries (Clontech, Palo AltoCalif.) and nested primers can be used to extend the sequence. For allPCR-based methods, primers can be designed using commercially availablesoftware (e.g., LASERGENE software, DNASTAR, Madison Wis. or anotherprogram), to be about 15 to 30 nucleotides in length, to have a GCcontent of about 50%, and to form a hybridization complex attemperatures of about 68° C. to 72° C.

[0051] In another aspect of the invention, the polynucleotide can becloned into a recombinant vector that directs the expression of theprotein, or structural or functional portions thereof, in host cells.Due to the inherent degeneracy of the genetic code, other DNA sequenceswhich encode functionally equivalent amino acid sequence can be producedand used to express the protein encoded by the polynucleotide. Thenucleotide sequences of the present invention can be engineered usingmethods generally known in the art in order to alter the nucleotidesequences for a variety of purposes including, but not limited to,modification of the cloning, processing, and/or expression of the geneproduct. DNA shuffling by random fragmentation, as described in U.S.Pat. No. 5,830,721, and PCR reassembly of gene fragments and syntheticoligonucleotides can be used to engineer the nucleotide sequences. Forexample, oligonucleotide-mediated site-directed mutagenesis can be usedto introduce mutations that create new restriction sites, alterglycosylation patterns, change codon preference, produce splicevariants, and so forth.

[0052] In order to express a biologically active protein, thepolynucleotide or derivatives thereof, can be inserted into anexpression vector with elements for transcriptional and translationalcontrol of the inserted coding sequence in a particular host. Theseelements include regulatory sequences, such as enhancers, constitutiveand inducible promoters, and 5′ and 3′ untranslated regions. Methodswhich are well known to those skilled in the art can be used toconstruct such expression vectors. These methods include in vitrorecombinant DNA techniques, synthetic techniques, and in vivo geneticrecombination (Ausubel, supra, unit 16).

[0053] A variety of expression vector/host cell systems can be utilizedto express the polynucleotide. These include, but are not limited to,microorganisms such as bacteria transformed with recombinantbacteriophage, plasmid, or cosimid expression vectors; yeast transformedwith yeast expression vectors; insect cell systems infected withbaculovirus vectors; plant cell systems transformed with viral orbacterial expression vectors; or animal cell systems. For long termproduction of recombinant proteins in mammalian systems, stableexpression in cell lines is preferred. For example, the polynucleotidecan be transformed into cell lines using expression vectors which cancontain viral origins of replication and/or endogenous expressionelements and a selectable or visible marker gene on the same or on aseparate vector. The invention is not to be limited by the vector orhost cell employed.

[0054] In general, host cells that contain the polynucleotide and thatexpress the protein can be identified by a variety of procedures knownto those of skill in the art. These procedures include, but are notlimited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, andprotein bioassay or immunoassay techniques which include membrane,solution, or chip-based technologies for the detection and/orquantification of nucleic acid or amino acid sequences. Immunologicalmethods for detecting and measuring the expression of the protein usingeither specific polyclonal or monoclonal antibodies are known in theart. Examples of such techniques include enzyme-linked immunosorbentassays (ELISAs), radioimmunoassays (RIAs), and fluorescence activatedcell sorting (FACS).

[0055] Host cells transformed with the polynucleotide can be culturedunder conditions for the expression and recovery of the protein fromcell culture. The protein produced by a transgenic cell can be secretedor retained intracellularly depending on the sequence and/or the vectorused. As will be understood by those of skill in the art, expressionvectors containing the polynucleotide can be designed to contain signalsequences which direct secretion of the protein through a prokaryoticcell wall or eukaryotic cell membrane.

[0056] In addition, a host cell strain can be chosen for its ability tomodulate expression of the inserted sequences or to process theexpressed protein in the desired fashion. Such modifications of theprotein include, but are not limited to, acetylation, carboxylation,glycosylation, phosphorylation, lipidation, and acylation.Post-translational processing which cleaves a “prepro” form of theprotein can also be used to specify protein targeting, folding, and/oractivity. Different host cells which have specific cellular machineryand characteristic mechanisms for post-translational activities (e.g.,CHO, HeLa, MDCK, HEK293, and W138) are available from the ATCC (ManassasVa.) and can be chosen to ensure the correct modification and processingof the expressed protein.

[0057] In another embodiment of the invention, natural, modified, orrecombinant polynucleotides are ligated to a heterologous sequenceresulting in translation of a fusion protein containing heterologousprotein moieties in any of the aforementioned host systems. Suchheterologous protein moieties facilitate purification of fusion proteinsusing commercially available affinity matrices. Such moieties include,but are not limited to, glutathione S-transferase, maltose bindingprotein, thioredoxin, calmodulin binding peptide, 6-His, FLAG, c-myc,hemaglutinin, and monoclonal antibody epitopes.

[0058] In another embodiment, the polynucleotides, wholly or in part,are synthesized using chemical or enzymatic methods well known in theart (Caruthers et al. (1980) Nucl Acids Symp Ser (7) 215-233; Ausubel,supra, units 10.4 and 10.16). Peptide synthesis can be performed usingvarious solid-phase techniques (Roberge et al. (1995) Science269:202-204), and machines such as the ABI 431A peptide synthesizer(ABI) can be used to automate synthesis. If desired, the amino acidsequence can be altered during synthesis to produce a more stablevariant for therapeutic use.

[0059] Screening, Diagnostics and Therapeutics

[0060] The polynucleotides can be used as surrogate markers indiagnosis, prognosis, evaluation of therapies and treatment of disordersassociated with cardiac muscle function including, but not limited to,atherosclerosis, arteriosclerosis, atrial fibrillation, cancer (myxoma)and complications of cancer, cardiac injury, congestive heart failure,coronary artery disease, hypertension, hypertrophic cardiomyopathy,myocardial hypertrophy, myocardial infarction, and plaque.

[0061] The polynucleotide can be used to screen a plurality or libraryof molecules and compounds for specific binding affinity. The assay canbe used to screen DNA molecules, RNA molecules, peptide nucleic acids,peptides, mimetics, ribozymes, or proteins including transcriptionfactors, enhancers, repressors, and the like which regulate the activityof the polynucleotide in the biological system. The assay involvesproviding a plurality of molecules and compounds, combining apolynucleotide or a composition of the invention with the plurality ofmolecules and compounds under conditions to allow specific binding, anddetecting specific binding to identify at least one molecule or compoundwhich specifically binds at least one polynucleotides of the invention.

[0062] Similarly the proteins, or portions thereof, can be used toscreen a plurality or library of molecules or compounds in any of avariety of screening assays to identify a ligand. The protein employedin such screening can be free in solution, affixed to an abioticsubstrate or expressed on the external, or a particular internalsurface, of a bacterial, or other, cell. Specific binding between theprotein and the ligand can be measured. The assay can be used to screenaptamers, DNA molecules, RNA molecules, peptide nucleic acids, peptides,mimetics, ribozymes, proteins, antibodies, agonists, antagonists,immunoglobulins, inhibitors, pharmaceutical agents or drug compounds andthe like, which specifically bind the protein. One method for highthroughput screening using very small assay volumes and very smallamounts of test compound is described in Burbaum et al. U.S. Pat. No.5,876,946, incorporated herein by reference, which screens large numbersof molecules for enzyme inhibition or receptor binding.

[0063] In one preferred embodiment, the polynucleotides are used fordiagnostic purposes to determine the differential expression of a genein a sample. The polynucleotide consists of complementary RNA and DNAmolecules, branched nucleic acids, and/or PNAs. In one alternative, thepolynucleotides are used to detect and quantify gene expression inbiopsied samples in which differential expression of the polynucleotideindicates the presence of a disorder. In another alternative, thepolynucleotide can be used to detect genetic polymorphisms associatedwith a disease or disorder. In a preferred embodiment, thesepolymorphisms are detected in an mRNA transcribed from an endogenousgene.

[0064] In another preferred embodiment, the polynucleotide is used as aprobe. Specificity of the probe is determined by whether it is made froma unique region, a regulatory region, or from a region encoding aconserved motif. Both probe specificity and the stringency of thediagnostic hybridization or amplification will determine whether theprobe identifies only naturally occurring, exactly complementarysequences, allelic variants, or related sequences. Probes designed todetect related sequences should preferably have at least 50% sequenceidentity to at least a fragment of a polynucleotide of the invention.

[0065] Methods for producing hybridization probes include the cloning ofnucleic acid sequences into vectors for the production of RNA probes.Such vectors are known in the art, are commercially available, and canbe used to synthesize RNA probes in vitro by adding RNA polymerases andlabeled nucleotides. Probes can incorporate nucleotides labeled by avariety of reporter groups including, but not limited to, radionuclidessuch as ³²P or ³⁵S, enzymatic labels such as alkaline phosphatasecoupled to the probe via avidin/biotin coupling systems, fluorescentlabels such as Cy3 and Cy5, and the like. The labeled polynucleotidescan be used in Southern or northern analysis, dot blot, or othermembrane-based technologies, on chips or other substrates, and in PCRtechnologies. Hybridization probes are also useful in mapping thenaturally occurring genomic sequence. Fluorescent in situ hybridization(FISH) can be correlated with other physical chromosome mappingtechniques and genetic map data as described in Heinz-Ulrich et al. (In:Meyers, supra, pp. 965-968). In many cases, genomic context helpsidentify genes that encode a particular protein family. (See, e.g.,Kirschning et al. (1997) Genomics 46:416-25.)

[0066] The polynucleotide can be labeled using standard methods andadded to a sample from a subject under conditions for the formation anddetection of hybridization complexes. After incubation the sample iswashed, and the signal associated with complex formation is quantitatedand compared with at least one standard value. Standard values arederived from any control sample, typically one that is free of thesuspect disorder and from one that represents a single, specific andpreferably, staged disorder. If the amount of signal in the subjectsample is distinguishable from the standards, then differentialexpression in the subject sample indicates the presence of the disorder.Qualitative and quantitative methods for comparing complex formation insubject samples with previously established standards are well known inthe art.

[0067] Such assays can also be used to evaluate the efficacy of aparticular therapeutic treatment regimen in animal studies, in clinicaltrials, or to monitor the treatment of an individual subject. Once thepresence of the disorder has been established and a treatment protocolis initiated, hybridization, amplification, or antibody assays can berepeated on a regular basis to determine when gene or protein expressionin the patient begins to approximate that which is observed in a healthysubject. The results obtained from successive assays can be used to showthe efficacy of treatment over a period ranging from several hours, e.g.in the case of toxic shock, to many years, e.g. in the case ofosteoarthritis.

[0068] The polynucleotides can be used on a substrate such as amicroarray to monitor gene expression, to identify splice variants,mutations, and polymorphisms. Information derived from analyses ofexpression patterns can be used to determine gene function, tounderstand the genetic basis of a disease, to diagnose a disorder, andto develop and monitor the activities of therapeutic agents used totreat a disorder. Microarrays can also be used to detect geneticdiversity, single nucleotide polymorphisms, which may characterize aparticular population, at the genomic level.

[0069] In another embodiment, antibodies or Fabs comprising an antigenbinding site that specifically binds the protein can be used for thediagnosis of diseases characterized by the differential expression ofthe protein. A variety of protocols for measuring protein expression,including ELISAs, RIAs, FACS and antibody arrays, are well known in theart and provide a basis for diagnosing differential or abnormal levelsof expression. Standard values for protein expression parallel thosereviewed above for nucleotide expression. The amount of complexformation can be quantitated by various methods, preferably byphotometric means. Quantities of the protein expressed in subjectsamples are compared with standard values. Deviation between standardand subject values establishes the parameters for diagnosing ormonitoring a particular disorder. Alternatively, one can use competitivedrug screening assays in which neutralizing antibodies capable ofbinding specifically with the protein compete with a test compound.Antibodies can be used to detect the presence of any peptide whichshares one or more epitopes or antigenic determinants with the protein.In one aspect, the antibodies of the present invention can be used fortreatment of a disorder, delivery of therapeutics, or monitoring therapyduring treatment.

[0070] In another aspect, the polynucleotide, or its complement, can beused therapeutically for the purpose of expressing mRNA and protein, orconversely to block transcription or translation of the mRNA. Expressionvectors can be constructed using elements from retroviruses,adenoviruses, herpes or vaccinia viruses, or bacterial plasmids, and thelike. These vectors can be used for delivery of nucleotide sequences toa particular target cell population, tissue, or organ. Methods wellknown to those skilled in the art can be used to construct vectors toexpress the polynucleotides or their complements. (See, e.g., Maulik etal. (1997) Molecular Biotechnology, Therapeutic Applications andStrategies, Wiley-Liss, New York N.Y.)

[0071] Alternatively, the polynucleotide or its complement, can be usedfor somatic cell or stem cell gene therapy. Vectors can be introduced invivo, in vitro, and ex vivo. For ex vivo therapy, vectors are introducedinto stem cells taken from the subject, and the resulting transgeniccells are clonally propagated for autologous transplant back into thatsame subject. Delivery of the polynucleotide by transfection, liposomeinjections, or polycationic amino polymers can be achieved using methodswhich are well known in the art. (See, e.g., Goldman et al. (1997)Nature Biotechniology 15:462-466.) Additionally, endogenous geneexpression can be inactivated using homologous recombination methodswhich insert an inactive gene sequence into the coding region or othertargeted region of the genome. (See, e.g. Thomas et al. (1987) Cell 51:503-512.)

[0072] Vectors containing the polynucleotide can be transformed into acell or tissue to express a missing protein or to replace anonfunctional protein. Similarly a vector constructed to express thecomplement of the polynucleotide can be transformed into a cell todown-regulate protein expression. Complementary or antisense sequencescan consist of an oligonucleotide derived from the transcriptioninitiation site; nucleotides between about positions −10 and +10 fromthe ATG are preferred. Similarly, inhibition can be achieved usingtriple helix base-pairing methodology. Triple helix pairing is usefulbecause it causes inhibition of the ability of the double helix to opensufficiently for the binding of polymerases, transcription factors, orregulatory molecules. Recent therapeutic advances using triplex DNA havebeen described in the literature. (See, e.g., Gee et al. In: Huber andCarr (1994) Molecular and Immunologic Approaches, Futura Publishing, Mt.Kisco N.Y., pp. 163-177.)

[0073] Ribozymes, enzymatic RNA molecules, can also be used to catalyzethe cleavage of mRNA and decrease the levels of particular mRNAs, suchas those comprising the polynucleotides of the invention. (See, e.g.,Rossi (1994) Current Biology 4: 469-471.) Ribozymes can cleave mRNA atspecific cleavage sites. Alternatively, ribozymes can cleave mRNAs atlocations dictated by flanking regions that form complementary basepairs with the target mRNA. The construction and production of ribozymesis well known in the art and is described in Meyers (supra).

[0074] RNA molecules can be modified to increase intracellular stabilityand half-life. Possible modifications include, but are not limited to,the addition of flanking sequences at the 5′ and/or 3′ ends of themolecule, or the use of phosphorothioate or 2′O-methyl rather thanphosphodiester linkages within the backbone of the molecule.Alternatively, nontraditional bases such as inosine, queosine, andwybutosine, as well as acetyl-, methyl-, thio-, and similarly modifiedforms of adenine, cytidine, guanine, thymine, and uridine which are notas easily recognized by endogenous endonucleases, can be included.

[0075] Further, an antagonist, or an antibody that binds specifically tothe protein can be administered to a subject to treat a disordersassociated with cardiac muscle function. The antagonist, antibody, orfragment can be used directly to inhibit the activity of the protein orindirectly to deliver a therapeutic agent to cells or tissues whichexpress the protein. The therapeutic agent can be a cytotoxic agentselected from a group including, but not limited to, abrin, ricin,doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin,etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxyanthracin dione, actinomycin D, diphteria toxin, Pseudomonas exotoxin Aand 40, radioisotopes, and glucocorticoid.

[0076] Antibodies to the protein can be generated using methods that arewell known in the art. One method involves immunizing a animal with theprotein selected from SEQ ID NOs: 49-62 under conditions to elicit anantibody response; isolating animal antibodies; attachinig the proteinto a substrate; contacting the substrate with isolated antibodies underconditions to allow specific binding to the protein; and dissociatingthe antibodies from the protein, thereby obtaining purified antibodies.Such antibodies can include, but are not limited to, polyclonal,monoclonal, chimeric, and single chain antibodies, Fab fragments, andfragments produced by a Fab expression library. Neutralizing antibodies,such as those which inhibit dimer formation, are especially preferredfor therapeutic use. Monoclonal antibodies to the protein can beprepared using any technique which provides for the production ofantibody molecules by continuous cell lines in culture. These include,but are not limited to, the hybridoma, the human B-cell hybridoma, andthe EBV-hybridoma techniques. In addition, techniques developed for theproduction of chimeric antibodies can be used. (See, e.g., Pound (1998)Immunochemical Protocols, Methods Mol Biol Vol. 80.) Alternatively,techniques described for the production of single chain antibodies canbe employed. Fabs which contain specific binding sites for the proteincan also be generated. Various immunoassays can be used to identifyantibodies having the desired specificity. Numerous protocols forcompetitive binding or immunoradiometric assays using either polyclonalor monoclonal antibodies with established specificities are well knownin the art.

[0077] Yet further, an agonist of the protein can be administered to asubject to treat a disorder associated with decreased expression,longevity or activity of the protein.

[0078] An additional aspect of the invention relates to theadministration of a pharmaceutical or sterile composition, inconjunction with a pharmaceutically acceptable carrier, for any of thetherapeutic applications discussed above. Such pharmaceuticalcompositions can consist of the protein or antibodies, mimetics,agonists, antagonists, or inhibitors of the protein. The compositionscan be administered alone or in combination with at least one otheragent, such as a stabilizing compound, which can be administered in anysterile, biocompatible pharmaceutical carrier including, but not limitedto, saline, buffered saline, dextrose, and water. The compositions canbe administered to a subject alone or in combination with other agents,drugs, or hormones.

[0079] The pharmaceutical compositions utilized in this invention can beadministered by any number of routes including, but not limited to,oral, intravenous, intramuscular, intra-arterial, intramedullary,intrathecal, intraventricular, transdermal, subcutaneous,intraperitoneal, intranasal, enteral, topical, sublingual, or rectalmeans.

[0080] In addition to the active ingredients, these pharmaceuticalcompositions can contain pharmaceutically-acceptable carriers comprisingexcipients and auxiliaries which facilitate processing of the activecompounds into preparations which can be used pharmaceutically. Furtherdetails on techniques for formulation and administration can be found inthe latest edition of Remington's Pharmaceutical Sciences (MackPublishing, Easton Pa.).

[0081] For any compound, the therapeutically effective dose can beestimated initially either in cell culture assays or in animal modelssuch as mice, rats, rabbits, dogs, or pigs. An animal model can also beused to determine the concentration range and route of administration.Such information can then be used to determine useful doses and routesfor administration in humans.

[0082] A therapeutically effective dose refers to that amount of activeingredient which ameliorates the symptoms or condition. Therapeuticefficacy and toxicity can be determined by standard pharmaceuticalprocedures in cell cultures or with experimental animals, such as bycalculating and contrasting the ED₅₀ (the dose therapeutically effectivein 50% of the population) and LD₅₀ (the dose lethal to 50% of thepopulation) statistics. Any of the therapeutic compositions describedabove can be applied to any subject in need of such therapy, including,but not limited to, mammals such as dogs, cats, cows, horses, rabbits,monkeys, and most preferably, humans.

[0083] Stem Cells and Their Use

[0084] SEQ ID NOs: 1-48 can be useful in the differentiation of stemcells. Eukaryotic stem cells are able to differentiate into the multiplecell types of various tissues and organs and to play roles inembryogenesis and adult tissue regeneration (Gearhart (1998) Science282:1061-1062; Watt and Hogan (2000) Science 287:1427-1430). Dependingon their source and developmental stage, stem cells can be totipotentwith the potential to create every cell type in an organism and togenerate a new organism, pluripotent with the potential to give rise tomost cell types and tissues, but not a whole organism; or multipotentcells with the potential to differentiate into a limited number of celltypes. Stem cells can be transfected with polynucleotides which can betransiently expressed or can be integrated within the cell astransgenes.

[0085] Embryonic stem (ES) cell lines are derived from the inner cellmasses of human blastocysts and are pluripotent (Thomson et al. (1998)Science 282:1145-1147). They have normal karyotypes and express highlevels of telomerase which prevent senescence and allow the cells toreplicate indefinitely. ES cells produce derivatives that give rise toembryonic epidermal, mesodermal and endodermal cells. Embryonic germ(EG) cell lines, which are produced from primordial germ cells isolatedfrom gonadal ridges and mesenteries, also show stem cell behavior(Shamblott et al. (1998) Proc Natl Acad Sci 95:13726-13731). EG cellshave normal karyotypes and appear to be pluripotent.

[0086] Organ-specific adult stem cells differentiate into the cell typesof the tissues from which they were isolated. They maintain theiroriginal tissues by replacing cells destroyed from disease or injury.Adult stem cells are multipotent and under proper stimulation can beused to generate cell types of various other tissues (Vogel (2000)Science 287:1418-1419). Hematopoietic stem cells from bone marrowprovide not only blood and immune cells, but can also be induced totransdifferentiate to form brain, liver, heart, skeletal muscle andsmooth muscle cells. Similarly mesenchymal stem cells can be used toproduce bone marrow, cartilage, muscle cells, and some neuron-likecells, and stem cells from muscle have the ability to differentiate intomuscle and blood cells (Jackson et al. (1999) Proc Natl Acad Sci96:14482-14486). Neural stem cells, which produce neurons and glia, canalso be induced to differentiate into heart, muscle, liver, intestine,and blood cells (Kuhn and Svendsen (1999) BioEssays 21:625-630); Clarkeet al. (2000) Science 288:1660-1663; Gage (2000) Science 287:1433-1438;and Galli et al. (2000) Nature Neurosci 3:986-991).

[0087] Neural stem cells can be used to treat neurological disorderssuch as Alzheimer's disease, Parkinson's disease, and multiple sclerosisand to repair tissue damaged by strokes and spinal cord injuries.Hematopoietic stem cells can be used to restore immune function inimmunodeficient patients or to treat autoimmune disorders by replacingautoreactive immune cells with normal cells to treat diseases such asmultiple sclerosis, scleroderma, rheumatoid arthritis, and systemiclupus erythematosus. Mesenchymal stem cells can be used to repairtendons or to regenerate cartilage to treat arthritis. Liver stem cellscan be used to repair liver damage. Pancreatic stem cells can be used toreplace islet cells to treat diabetes. Muscle stem cells can be used toregenerate muscle to treat muscular dystrophies (Fontes and Thomson(1999) BMJ 319:1-3; Weissman (2000) Science 287:1442-1446 Marshall(2000) Science 287:1419-1421; and Marmont (2000) Ann Rev Med51:115-134).

EXAMPLES

[0088] It is to be understood that this invention is not limited to theparticular devices, machines, materials and methods described. Althoughparticular embodiments are described, equivalent embodiments can be usedto practice the invention. The described embodiments are provided toillustrate the invention and are not intended to limit the scope of theinvention which is limited only by the appended claims.

[0089] I cDNA Library Construction

[0090] The cDNA library, LATRNOT01, was selected as an example todemonstrate library construction. The LATRNOT01 cDNA library wasconstructed from left atrial tissue obtained from a 51-year-oldCaucasian female who died of cerebral hemorrhage.

[0091] The frozen tissue was homogenized using a pestle and mortar andlysed using a POLYTRON homogenizer (Brinkmann Instruments, WestburyN.Y.) in guanidinium isothiocyanate solution. The lysate was centrifugedover a 5.7 M CsCl cushion using an SW28 swinging bucket rotor in anL8-70M ultracentrifuge (Beckman Coulter, Fullerton Calif.) for 18 hoursat 25,000 rpm and ambient temperature. The RNA was extracted twice withphenol, pH 8.0, precipitated using 0.3 M sodium acetate and 2.5 volumesof ethanol, resuspended in RNAse-free water, and treated with DNAse at37C. The mRNA was isolated using the OLIGOTEX kit (Qiagen, ChatsworthCalif.) and used to construct the cDNA library.

[0092] The mRNA was handled according to the recommended protocols inthe SUPERSCRIPT plasmid system (Life Technologies, Gaithersburg Md.).cDNAs were fractionated on a SEPHAROSE CL4B column (Amersham PharmaciaBiotech (APB), Piscataway N.J.), and those cDNAs exceeding 400 bp wereligated into the XhoI and EcoRI sites of the λ UNIZAP vector(Stratagene, La Jolla Calif.). The vector which contained thePBLUESCRIPT phagemid was subsequently transformed into XL1-BLUEMRF hostcells (Stratagene). The phagemid forms of individual cDNA clones wereobtained by the in vivo excision process, in which the host bacterialstrain was co-infected with both the λ library phage and an f1 helperphage. Enzymes derived from both the library-containing and helper phagenicked the λ DNA, initiated new DNA synthesis from defined sequences onthe λ target DNA, and created a smaller, single stranded circularphagemid DNA molecule that included all DNA sequences of the PBLUESCRIPTphagemid and the cDNA insert. The phagemid DNA was secreted from thecells, purified, and used to re-infect fresh host cells, where thedouble stranded phagemid DNA was produced.

[0093] II Isolation and Sequencing of cDNA Clones

[0094] Plasmid DNA was released from the bacterial cells and purifiedusing the REAL PREP 96 plasmid kit (Qiagen). This kit enabled thesimultaneous purification of 96 samples in a 96-well block usingmulti-channel reagent dispensers. The recommended protocol was employedexcept for the following changes: 1) the bacteria were cultured in 1 mlof sterile TERRIFIC BROTH (BD Biosciences, San Jose Calif.) withcarbenicillin at 25 mg/L and glycerol at 0.4%; 2) after inoculation, thecells were culture for 19 hours and then lysed in 0.3 ml of lysisbuffer; and 3) the plasmid DNA pellet was precipitated in isopropanoland then resuspended in 0.1 ml of distilled water. After the last stepin the protocol, samples were transferred to a 96-well block for storageat 4C.

[0095] The cDNAs were prepared using a MICROLAB 2200 system (Hamilton,Reno Nev.) in combination with DNA ENGINE thermal cyclers (MJ Research,Watertown Mass.). The cDNAs were sequenced by the method of Sanger andCoulson (1975; J Mol Biol 94:441-448) using ABI PRISM 373, 377 or 3700DNA sequencing systems (ABI). Most of the cDNAs were sequenced usingstandard ABI protocols and kits at solution volumes of 0.25×-1.0×. Inthe alternative, some of the cDNAs were sequenced using solutions anddyes from APB.

[0096] III Selection, Assembly, and Characterization of Sequences

[0097] The polynucleotides used for co-expression analysis wereassembled from EST sequences, 5′ and 3′ long read sequences, and fulllength coding sequences. The assembly process is described as follows.EST sequence chromatograms were processed and verified. Quality scoreswere obtained using PHRED (Ewing et al. (1998) Genome Res 8:175-185;Ewing and Green (1998) Genome Res 8:186-194), and edited sequences wereloaded into a relational database management system (RDBMS). Thesequences were clustered using BLAST with a product score of 50. Allclusters of two or more sequences created a bin which represents onetranscribed gene.

[0098] Assembly of the component sequences within each bin was performedusing a modification of Phrap, a publicly available program forassembling DNA fragments (Green, P. University of Washington, SeattleWash.). Bins that showed 82% identity from a local pair-wise alignmentbetween any of the consensus sequences were merged.

[0099] Bins were annotated by screening the consensus sequence in eachbin against public databases, such as GBpri and GenPept from NCBI. Theannotation process involved a FASTn screen against the GBpri database inGenBank. Those hits with a percent identity of greater than or equal to75% and an alignment length of greater than or equal to 100 base pairswere recorded as homolog hits. The residual unannotated sequences werescreened by FASTx against GenPept. Those hits with an E value of lessthan or equal to 10⁻⁸ were recorded as homolog hits.

[0100] Sequences were then reclustered using BLASTn and Cross-Match, aprogram for rapid amino acid and nucleic acid sequence comparison anddatabase search (Green, supra), sequentially. Any BLAST alignmentbetween a sequence and a consensus sequence with a score greater than150 was realigned using cross-match. The sequence was added to the binwhose consensus sequence gave the highest Smith-Waterman score (Smith etal. (1992) Protein Engineering 5:35-51) amongst local alignments with atleast 82% identity. Non-matching sequences were moved into new bins, andassembly processes were repeated.

[0101] IV Homology Searching of Polynucleotides and Their EncodedProteins

[0102] The polynucleotides of the Sequence Listing or their encodedproteins were used to query databases such as GenBank, SwissProt,BLOCKS, and the like. These databases that contain previously identifiedand annotated sequences or domains were searched using BLAST or BLAST 2(Altschul et al. supra; Altschul, supra) to produce alignments and todetermine which sequences were exact matches or homologs. The alignmentswere to sequences of prokaryotic (bacterial) or eukaryotic (animal,fungal, or plant) origin. Alternatively, algorithms such as the onedescribed in Smith and Smith (1992, Protein Engineering 5:35-51) couldhave been used to deal with primary sequence patterns and secondarystructure gap penalties. All of the sequences disclosed in thisapplication have lengths of at least 49 nucleotides, and no more than12% uncalled bases (where N is recorded rather than A, C, G, or T).

[0103] As detailed in Karlin and Altschul (1993; Proc Natl Acad Sci90:5873-5877), BLAST matches between a query sequence and a databasesequence were evaluated statistically and only reported when theysatisfied the threshold of 10⁻²⁵ for nucleotides and 10⁻¹⁴ for peptides.Homology was also evaluated by product score calculated as follows: the% nucleotide or amino acid identity [between the query and referencesequences] in BLAST is multiplied by the % maximum possible BLAST score[based on the lengths of query and reference sequences] and then dividedby 100. In comparison with hybridization procedures used in thelaboratory, the electronic stringency for an exact match was set at 70,and the conservative lower limit for an exact match was set atapproximately 40 (with 1-2% error due to uncalled bases).

[0104] The BLAST software suite, freely available sequence comparisonalgorithms (NCBI, Bethesda MD;http://www.ncbi.nlm.nih.gov/gorf/bl2.html), includes various sequenceanalysis programs including “blastn” that is used to align nucleic acidmolecules and BLAST 2 that is used for direct pairwise comparison ofeither nucleic or amino acid molecules. BLAST programs are commonly usedwith gap and other parameters set to default settings, e.g.: Matrix:BLOSUM62; Reward for match: 1; Penalty for mismatch: −2; Open Gap: 5 andExtension Gap: 2 penalties; Gap x drop-off: 50; Expect: 10; Word Size:11; and Filter: on. Identity or similarity is measured over the entirelength of a sequence or some smaller portion thereof. Brenner et al.(1998; Proc Natl Acad Sci 95:6073-6078, incorporated herein byreference) analyzed the BLAST for its ability to identify structuralhomologs by sequence identity and found 30% identity is a reliablethreshold for sequence alignments of at least 150 residues and 40%, foralignments of at least 70 residues.

[0105] The polynucleotides of this application were compared withassembled consensus sequences or templates found in the LIFESEQ GOLDdatabase. Component sequences from cDNA, extension, full length, andshotgun sequencing projects were subjected to PHRED analysis andassigned a quality score. All sequences with an acceptable quality scorewere subjected to various pre-processing and editing pathways to removelow quality 3′ ends, vector and linker sequences, polyA tails, Alurepeats, mitochondrial and ribosomal sequences, and bacterialcontamination sequences. Edited sequences had to be at least 50 bp inlength, and low-information sequences and repetitive elements such asdinucleotide repeats, Alu repeats, and the like, were replaced by “Ns”or masked.

[0106] Edited sequences were subjected to assembly procedures in whichthe sequences were assigned to polynucleotide bins. Each sequence couldonly belong to one bin, and sequences in each bin were assembled toproduce a template. Newly sequenced components were added to existingbins using BLAST and CROSSMATCH. To be added to a bin, the componentsequences had to have a BLAST quality score greater than or equal to 150and an alignment of at least 82% local identity. The sequences in eachbin were assembled using PHRAP. Bins with several overlapping componentsequences were assembled using DEEP PHRAP. The orientation of eachtemplate was determined based on the number and orientation of itscomponent sequences.

[0107] Bins were compared to one another and those having localsimilarity of at least 82% were combined and reassembled. Bins havingtemplates with less than 95% local identity were split. Templates weresubjected to analysis by STITCHER/EXON MAPPER algorithms that analyzethe probabilities of the presence of splice variants, alternativelyspliced exons, splice junctions, differential expression of alternativespliced genes across tissue types or disease states, and the like.Assembly procedures were repeated periodically, and templates wereannotated using BLAST against GenBank databases such as GBpri. An exactmatch was defined as having from 95% local identity over 200 base pairsthrough 100% local identity over 100 base pairs and a homolog match ashaving an E-value (or probability score) of ≦1×10⁻⁸. The templates werealso subjected to frameshift FASTx against GENPEPT, and homolog matchwas defined as having an E-value of ≦1×10⁻⁸. Template analysis andassembly was described in U.S. Ser. No. 09/276,534, filed Mar. 25, 1999.

[0108] Following assembly, templates were subjected to BLAST, motif, andother functional analyses and categorized in protein hierarchies usingmethods described in U.S. Ser. No. 08/812,290 and U.S. Ser. No.08/811,758, both filed Mar. 6, 1997; in U.S. Ser. No. 08/947,845, filedOct. 9, 1997; and in U.S. Ser. No. 09/034,807, filed Mar. 4, 1998. Thentemplates were analyzed by translating each template in all threeforward reading frames and searching each translation against the PFAMdatabase of hidden Markov model-based protein families and domains usingthe HMMER software package (Washington University School of Medicine,St. Louis Mo.; http://pfam.wustl.edu/).

[0109] The polynucleotide was further analyzed using MACDNASIS PROsoftware (Hitachi Software Engineering), and LASERGENE software(DNASTAR) and queried against public databases such as the GenBankrodent, mammalian, vertebrate, prokaryote, and eukaryote databases,SwissProt, BLOCKS, PRINTS, PFAM, and Prosite.

[0110] V Description of Known Cardiac Muscle-Associated Genes

[0111] Twelve known cardiac muscle-associated genes were selected toidentify novel polynucleotides that are closely associated with cardiacmuscle function. These known genes were atrial regulatory myosin,ventricular myosin alkali light chain, cardiac troponin, cardiacventricular myosin, cardiodilatin, creatine kinase M, myoglobin,natriuretic peptide precursor, sarcomeric mitochondrial creatine kinase,telethonin, titin, and urocortin.

[0112] Brief descriptions of the known cardiac muscle-associated genesand their expression in cardiac disorders are presented below. GENEDESCRIPTION AND REFERENCES atrial regulatory myosin Predominantregulatory myosin light chain isoform in adult atrial muscle.Differentially expressed in cardiovascular development and disease.Fewell et al. (1998) J Clin Invest 101: 2630-2639; Hailstones et al.(1992) J. Biol. Chem. 267: 23295-23300. ventricular myosin alkali Musclefiber protein. Differentially expressed in altered cardiovascularfunction and in light chain myocardial hypertrophy. Morano et al. (1997)J Mol Cell Cardiol 29: 1177-1187. troponin Marker of cardiac injury.Feng et al. (1998) Am J Clin Pathol 110: 70-77; Luscher et al. (1998)Cardiology 89: 222-228; and Kost et al. (1998) Arch Pathol Lab Med 122:245-251. cardiac ventricular Muscle fiber protein. Expressed in cardiacremodeling after myocardial infarction. myosin Differentially expressedin altered cardiovascular function. Trahair et al. (1993) J Mol CellCardiol 25: 577-585. cardiodilatin Differentially expressed followingmyocardial infarction. Induces vasorelaxation. Gidh- Jain et al. (1998)J Mol Cell Cardiol 30: 627-637; Magga et al. (1998) Ann Med 30(S1): 3945. creatine kinase M Marker of cardiac injury. Feng, supra; Luscher,supra; and Kost, supra. myoglobin Marker of cardiac injury. Feng, supra;Luscher, supra; and Kost, supra. natriuretic peptide See cardiodilatin.precursor sarcomeric Essential enzyme in energy metabolism, particularlyin tissue with high energy mitochondrial creatine requirements. Klein etal. (1991) J Biol Chem 266: 18058-18065; Qin et al. (1997) J Biol kinaseChem 272: 25210-25216. telethonin Sarcomeric protein of heart andskeletal muscle. Valle et al. (1997) FEBS Lett. 415: 163- 168; Mayans etal. (1998) Nature 395: 863-869. titin Muscle fiber protein. Temporal andspatial control of sarcomere assembly. Differentially expressed afteratrial fibrillation. Ausma et al. (1997) Am J Pathol 151: 985-997;Mayans, supra. urocortin Stimulates atrial natriuretic peptidesecretion. Expression increased following cardiac injury. Protectscardiac myocytes from hypoxic death. Ikeda et al. (1998) Biochem.Biophys Res Commun 250: 298-304; Asaba et al. (1998) Brain Res 806:95-103; and Okosi et al. (1998) Neuropeptides 32: 167-171.

[0113] VI Co-Expression Among Known Marker Genes and NovelPolynucleotides

[0114] GBA identified 48 novel polynucleotides from a total of 45,233assembled sequences that showed strong expression and association withthe known cardiac muscle-associated genes. The process was reiterateduntil the number of polynucleotides was reduced to the final 48polynucleotides shown below. Each of the 48 polynucleotides isco-expressed with at least one of the twelve known genes with a p-valueof less than 10⁻⁰⁵. The co-expression of the novel polynucleotides andthe known genes are shown in Table 1-1, 1-2, and 1-3. The novelpolynucleotides are listed along the top of the table by their SEQ IDNO, and the known genes, by their names in the rows down the side of thetable. The entries in the table are the negative log of the p-value(-log p) for the co-expression of two sequences. For eachpolynucleotide, the p-value is the probability that the observedco-expression is due to chance, using the Fisher Exact Test.

[0115] The highest co-expression value is obtained when the highestp-value found in a vertical column below the SEQ ID NO (clone number) iscorrelated with the name of a known marker gene listed for that row. Forexample, SEQ ID NO: 4, has a p-value of 19 as it co-expresses withcardiac ventricular myosin. This highly significant p-valuesubstantiates that SEQ ID NO: 4, SEQ ID NO: 49, and an antibody whichspecifically binds SEQ ID NO: 49 can be used as surrogate markers forcardiac ventricular myosin in a diagnostic assay for myocardialinfarction.

[0116] The data above can be summarized by reducing it to a singlehighest co-expression (-log p) value for each intersecting known geneand unknown polynucleotide and naming at least one disorder associatedwith expression of the known gene. A summary table is shown below: SEQID NO p-value Gene Disorder 1 7 atrial regulatory myosin cardiac injury2 6 natriuretic peptide precursor myocardial infarction 3 7 telethoninatrial fibrillation 4 19 cardiac ventricular myosin myocardialinfarction 5 9 creatine kinase M cardiac injury 6 11 titin atrialfibrillation 7 10 troponin cardiac injury 8 6 natriuretic peptideprecursor myocardial infarction 9 6 urocortin myocardial infarction 1012 telethonin atrial fibrillation 11 8 creatine kinase M cardiac injury12 9 atrial regulatory myosin cardiac injury 13 22 titin atrialfibrillation 14 8 ventricular myosin alkali light chain myocardialhypertrophy 15 10 titin atrial fibrillation 16 7 titin atrialfibrillation 17 8 telethonin atrial fibrillation 18 6 urocortinmyocardial infarction 19 11 creatine kinase M cardiac injury 20 13myoglobin cardiac injury 21 10 ventricular myosin alkali light chainmyocardial hypertrophy 22 10 troponin cardiac injury 23 11 titin atrialfibrillation 24 7 ventricular myosin alkali light chain myocardialhypertrophy 25 9 ventricular myosin alkali light chain myocardialhypertrophy 26 18 creatine kinase M cardiac injury 27 19 ventricularmyosin alkali light chain myocardial hypertrophy 28 21 creatine kinase Mcardiac injury 29 5 sarcomeric mitoch. creatine kinase hypertension 3015 myoglobin cardiac injury 31 7 telethonin atrial fibrillation 32 8creatine kinase M cardiac injury 33 11 titin atrial fibrillation 34 9atrial regulatory myosin cardiac injury 35 8 creatine kinase M cardiacinjury 36 7 cardiac ventricular myosin myocardial infarction 37 16myoglobin cardiac injury 38 11 myoglobin cardiac injury 39 21 creatinekinase M cardiac injury 40 11 creatine kinase M cardiac injury 41 20creatine kinase M cardiac injury 42 8 titin atrial fibrillation 43 6cardiac ventricular myosin myocardial infarction 44 7 cardiodilantinmyocardial infarction 45 10 telethonin atrial fibrillation 46 11creatine kinase M cardiac injury 47 9 atrial regulatory myosin cardiacinjury 48 9 telethonin atrial fibrillation

[0117] VII Description of the Polynucleotides Identified Using GBA

[0118] Using the method of Walker (supra), 48 polynucleotides thatexhibit strong association, or co-expression, with cardiacmuscle-associated genes have been identified.

[0119] Polynucleotides comprising the nucleic acid sequences of SEQ IDNOs: 1-48 of the present invention were first identified as IncyteClones 2045674, 188552, 465676, 3601719, 305781, 971441, 3445829,189299, 2396760, 919893, 2837330, 1737459, 058201, 767447, 5449893,2951269, 282977, 3178454, 3563859, 985730, 3684987, 986166, 1887508,1006416, 975169, 4152861, 986464,118472, 1314633, 1997439, 2638878,3795510, 1413537, 1623157, 3009303, 3434460, 5022769, 944140, 3445829,3016490, 4151935, 3719652, 3046106, 3012947, 466761, 1644171, 3009806,and 5578191, respectively; and assembled according to Example III. Asdescribed in Example IV, BLAST and other motif searches were performedfor each sequence. SEQ ID NOs: 1-48 were translated, and identity withknown sequences was sought. Proteins comprising SEQ ID NOs: 49-62 werealso analyzed using BLAST and other motif search tools as disclosed inExample VI. The details of the various analyses are described in Table2.

[0120] VIII Hybridization Technologies and Analyses

[0121] Immobilization of Polynucleotides on a Substrate

[0122] The polynucleotides are applied to a substrate by one of thefollowing methods. A mixture of polynucleotides is fractionated by gelelectrophoresis and transferred to a nylon membrane by capillarytransfer. Alternatively, the polynucleotides are individually ligated toa vector and inserted into bacterial host cells to form a library. Thepolynucleotides are then arranged on a substrate by one of the followingmethods. In the first method, bacterial cells containing individualclones are robotically picked and arranged on a nylon membrane. Themembrane is placed on LB agar containing selective agent (carbenicillin,kanamycin, ampicillin, or chloramphenicol depending on the vector used)and incubated at 37° C. for 16 hr. The membrane is removed from the agarand consecutively placed colony side up in 10% SDS, denaturing solution(1.5 M NaCl, 0.5 M NaOH), neutralizing solution (1.5 M NaCl, 1 MTris-HCl, pH 8.0), and twice in 2×SSC for 10 min each. The membrane isthen UV irradiated in a STRATALINKER UV-crosslinker (Stratagene).

[0123] In the second method, polynucleotides are amplified frombacterial vectors by thirty cycles of PCR using primers complementary tovector sequences flanking the insert. PCR amplification increases astarting concentration of 1-2 ng nucleic acid to-a final quantitygreater than 5 μg. Amplified nucleic acids from about 400 bp to about5000 bp in length are purified using SEPHACRYL-400 beads (APB). Purifiednucleic acids are arranged on a nylon membrane manually or using adot/slot blotting manifold and suction device and are immobilized bydenaturation, neutralization, and UV irradiation as described above.Purified nucleic acids are robotically arranged and immobilized onpolymer-coated glass slides using the procedure described in U.S. Pat.No. 5,807,522. Polymer-coated slides are prepared by cleaning glassmicroscope slides (Corning, Acton Mass.) by ultrasound in 0.1% SDS andacetone, etching in 4% hydrofluoric acid (VWR Scientific Products, WestChester Pa.), coating with 0.05% aminopropyl silane (Sigma-Aldrich) in95% ethanol, and curing in a 110° C. oven. The slides are washedextensively with distilled water between and after treatments. Thenucleic acids are arranged on the slide and then immobilized by exposingthe array to UV irradiation using a STRATALINKER Uv-crosslinker(Stratagene). Arrays are then washed at room temperature in 0.2% SDS andrinsed three times in distilled water. Non-specific binding sites areblocked by incubation of arrays in 0.2% casein in phosphate bufferedsaline (PBS; Tropix, Bedford Mass.) for 30 min at 60C.; then the arraysare washed in 0.2% SDS and rinsed in distilled water as before.

[0124] Probe Preparation for Membrane Hybridization

[0125] Hybridization probes derived from the polynucleotides of theSequence Listing are employed for screening cDNAs, mRNAs, or genomic DNAin membrane-based hybridizations. Probes are prepared by diluting thepolynucleotides to a concentration of 40-50 ng in 45 μl TE buffer,denaturing by heating to 100° C. for five min, and briefly centrifuging.The denatured polynucleotide is then added to a REDIPRIME tube (APB),gently mixed until blue color is evenly distributed, and brieflycentrifuged. Five μl of [³²P]dCTP is added to the tube, and the contentsare incubated at 37° C. for 10 min. The labeling reaction is stopped byadding 5 μl of 0.2M EDTA, and probe is purified from unincorporatednucleotides using a PROBEQUANT G-50 microcolumn (APB). The purifiedprobe is heated to 100° C. for five min, snap cooled for two min on ice,and used in membrane-based hybridizations as described below.

[0126] Probe Preparation for Polymer Coated Slide Hybridization

[0127] Hybridization probes derived from mRNA isolated from samples areemployed for screening polynucleotides of the Sequence Listing inarray-based hybridizations. Probe is prepared using the GEMbright kit(Incyte Genomics) by diluting mRNA to a concentration of 200 ng in 9 μlTE buffer and adding 5 μl 5× buffer, 1 μl 0.1 M DTT, 3 μl Cy3 or Cy5labeling mix, 1 μl RNAse inhibitor, 1 μl reverse transcriptase, and 5 μl1× yeast control mRNAs. Yeast control mRNAs are synthesized by in vitrotranscription from noncoding yeast genomic DNA (W. Lei, unpublished). Asquantitative controls, one set of control mRNAs at 0.002 ng, 0.02 ng,0.2 ng, and 2 ng are diluted into reverse transcription reaction mixtureat ratios of 1:100,000, 1:10,000, 1:1000, and 1:100 (w/w) to sample mRNArespectively. To examine mRNA differential expression patterns, a secondset of control mRNAs are diluted into reverse transcription reactionmixture at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, and 25:1 (w/w). Thereaction mixture is mixed and incubated at 37° C. for two hr. Thereaction mixture is then incubated for 20 min at 85C., and probes arepurified using two successive CHROMA SPIN+TE 30 columns (Clontech, PaloAlto Calif.). Purified probe is ethanol precipitated by diluting probeto 90 μl in DEPC-treated water, adding 2 μl 1 mg/ml glycogen, 60 μl 5 Msodium acetate, and 300 μl 100% ethanol. The probe is centrifuged for 20min at 20,800μg, and the pellet is resuspended in 12 μl resuspensionbuffer, heated to 65° C. for five min, and mixed thoroughly. The probeis heated and mixed as before and then stored on ice. Probe is used inhigh density array-based hybridizations as described below.

[0128] Membrane-Based Hybridization

[0129] Membranes are pre-hybridized in hybridization solution containing1% Sarkosyl and 1× high phosphate buffer (0.5 M NaCl, 0.1 M Na₂HPO₄, 5mM EDTA, pH 7) at 55° C. for two hr. The probe diluted in 15 ml freshhybridization solution, is then added to the membrane. The membrane ishybridized with the probe at 55° C. for 16 hr. Following hybridization,the membrane is washed for 15 min at 25° C. in 1 mM Tris (pH 8.0), 1%Sarkosyl, and four times for 15 min each at 25° C. in 1 mM Tris (pH8.0). To detect hybridization complexes, XOMAT-AR film (Eastman Kodak,Rochester N.Y.) is exposed to the membrane overnight at −70° C.,developed, and examined visually.

[0130] Polymer Coated Slide-based Hybridization

[0131] Probe is heated to 65° C. for five min, centrifuged five min at9400 rpm in a 5415C microcentrifuge (Eppendorf Scientific, WestburyN.Y.), and then 18 μl are aliquoted onto the array surface and coveredwith a coverslip. The arrays are transferred to a waterproof chamberhaving a cavity just slightly larger than a microscope slide. Thechamber is kept at 100% humidity internally by the addition of 140 μl of5×SSC in a corner of the chamber. The chamber containing the arrays isincubated for about 6.5 hr at 60C. The arrays are washed for 10 min at45° C. in 1×SSC, 0.1% SDS, and three times for 10 min each at 45° C. in0.1×SSC, and dried.

[0132] Hybridization reactions are performed in absolute or differentialhybridization formats. In the absolute hybridization format, probe fromone sample is hybridized to array elements, and signals are detectedafter hybridization complexes form. Signal strength correlates withprobe mRNA levels in the sample. In the differential hybridizationformat, differential expression of a set of genes in two biologicalsamples is analyzed. Probes from the two samples are prepared andlabeled with different labeling moieties. A mixture of the two labeledprobes is hybridized to the array elements, and signals are examinedunder conditions in which the emissions from the two different labelsare individually detectable. Elements on the array that are hybridizedto equal numbers of probes derived from both biological samples give adistinct combined fluorescence (Shalon WO95/35505).

[0133] Hybridization complexes are detected with a microscope equippedwith an INNOVA 70 mixed gas 10 W laser (Coherent, Santa Clara Calif.)capable of generating spectral lines at 488 nm for excitation of Cy3 andat 632 nm for excitation of Cy5. The excitation laser light is focusedon the array using a 20× microscope objective (Nikon, Melville N.Y.).The slide containing the array is placed on a computer-controlled X-Ystage on the microscope and raster-scanned past the objective with aresolution of 20 micrometers. In the differential hybridization format,the two fluorophores are sequentially excited by the laser. Emittedlight is split, based on wavelength, into two photomultiplier tubedetectors (PMT R1477, Hamamatsu Photonics Systems, Bridgewater N.J.)corresponding to the two fluorophores. Appropriate filters positionedbetween the array and the photomultiplier tubes are used to filter thesignals. The emission maxima of the fluorophores used are 565 nm for Cy3and 650 nm for Cy5. The sensitivity of the scans is calibrated using thesignal intensity generated by the yeast control mRNAs added to the probemix. A specific location on the array contains a complementary DNAsequence, allowing the intensity of the signal at that location to becorrelated with a weight ratio of hybridizing species of 1:100,000.

[0134] The output of the photomultiplier tube is digitized using a12-bit RTI-835H analog-to-digital (A/D) conversion board (AnalogDevices, Norwood Mass.) installed in an IBM-compatible PC computer. Thedigitized data are displayed as an image where the signal intensity ismapped using a linear 20-color transformation to a pseudocolor scaleranging from blue (low signal) to red (high signal). The data is alsoanalyzed quantitatively. Where two different fluorophores are excitedand measured simultaneously, the data are first corrected for opticalcrosstalk (due to overlapping emission spectra) between the fluorophoresusing the emission spectrum for each fluorophore. A grid is superimposedover the fluorescence signal image such that the signal from each spotis centered in each element of the grid. The fluorescence signal withineach element is then integrated to obtain a numerical valuecorresponding to the average intensity of the signal. The software usedfor signal analysis is the GEMTOOLS program (Incyte Genomics).

[0135] IX Transcript Imaging

[0136] The transcript image performed using the LIFESEQ GOLD database(Aug00rel, Incyte Genomics) allowed assessment of the relative abundanceof expressed polynucleotides in one or more cDNA libraries. Criteria fortranscript imaging include category, number of cDNAs per library,description of the library, and the like.

[0137] All sequences and cDNA libraries in the LIFESEQ database werecategorized by system, organ/tissue and cell type. The categoriesincluded cardiovascular system, connective tissue, digestive system,embryonic structures, endocrine system, exocrine glands, female and malereproductive, germ cells, hemic/immune system, liver, musculoskeletalsystem, nervous system, pancreas, respiratory system, sense organs,skin, stomatognathic system, unclassified/mixed, and the urinary tract.For each category, the number of libraries in which the sequence wasexpressed were counted and shown over the total number of libraries inthat category. In some transcript images, all normalized or pooledlibraries, which have high copy number sequences removed prior toprocessing, and all mixed or pooled tissues, which are considerednon-specific in that they contain more than one tissue type or more thanone subject's tissue, can be excluded from the analysis. Cell linesand/or fetal tissue data can also be disregarded unless the elucidationof inherited disorders would be furthered by their inclusion in theanalysis.

[0138] For diagnostic purposes, the standards to which biopsied sampleswould be compared are: cytologically normal, non-diseased samples versussamples which had been diagnosed with specific cardiac disordersincluding, but not limited to, atherosclerosis, arteriosclerosis, atrialfibrillation, cancer (myxoma) and complications of cancer, cardiacinjury, congestive heart failure, coronary artery disease, hypertension,hypertrophic cardiomyopathy, myocardial hypertrophy, myocardialinfarction, and plaque.

[0139] For purposes of example, the transcript images for SEQ ID NOs: 29and 44 are shown below. The first column shows library name; the secondcolumn, the number of cDNAs sequenced in that library; the third column,the description of the library; and the fourth column, absoluteabundance of the transcript in the library. Library cDNA DescriptionAbundance % Abundance SEQ ID NO:29 (Category: Cardiovascular*) HEARNOT063685 heart, hypertension, 44M 2 0.0543 HEARFET05 2524 heart, fetal, M 10.0396 HEARFET02 6919 heart, hypoplastic left, fetal, 23wM 1 0.0145 *Nolibraries were removed from the analysis. SEQ ID NO:44 (Category:Cardiovascular*) HEALDIT02 4171 left ventricle, mw/myocardialinfarction, 56M 1 0.0240 HEARFET02 6919 heart, hypoplastic left, fetal,23wM 1 0.0145 *Normalized and pooled libraries were removed from theanalysis.

[0140] SEQ ID NOs: 29 and 44 were differentially expressed when comparedby percent abundance to useful standards (i.e., the up-regulation of SEQID NOs: 29 in heart tissue of a deceased victim who was shot to death isnot a comparison that would be made in a diagnostic setting). Moreimportantly, these sequences are not differentially expressed in anynormal tissue or diagnostic of any other cardiac disorder.

[0141] The differential expression of SEQ ID NOs: 29, and 44,respectively, in tissue associated with hypertension and myocardialinfarction, respectively, supports the use of the sequences as asurrogate markers for sarcomeric mitochondrial creatine kinase andcardiodilantin, respectively. These transcript images verify GBAanalysis (see Example VI above).

[0142] X Complementary Molecules

[0143] The complement of the novel polynucleotide, from about 5 bp(e.g., a PNA) to about 5000 bp (e.g., the complement of a cDNA insert),are used to detect or inhibit gene expression. These molecules areselected using LASERGENE software (DNASTAR). Detection is described inExample VIII. To inhibit transcription by preventing promoter binding,the complementary molecule is designed to bind to the most unique 5′sequence and includes nucleotides of the 5′ UTR upstream of theinitiation codon of the open reading frame. Complementary moleculesinclude genomic sequences (such as enhancers or introns) and are used in“triple helix” base pairing to compromise the ability of the doublehelix to open sufficiently for the binding of polymerases, transcriptionfactors, or regulatory molecules. To inhibit translation, acomplementary molecule is designed to prevent ribosomal binding to themRNA encoding the protein.

[0144] Complementary molecules are placed in expression vectors and usedto transform a cell line to test efficacy; into an organ, tumor,synovial cavity, or the vascular system for transient or short termtherapy; or into a stem cell, zygote, or other reproducing lineage forlong term or stable gene therapy. Transient expression lasts for a monthor more with a non-replicating vector and for three months or more ifappropriate elements for inducing vector replication are used in thetransformation/expression system.

[0145] Stable transformation of appropriate dividing cells with a vectorencoding the complementary molecule produces a transgenic cell line,tissue, or organism (U.S. Pat. No. 4,736,866). Those cells thatassimilate and replicate sufficient quantities of the vector to allowstable integration also produce enough complementary molecules tocompromise or entirely eliminate activity of the polynucleotide encodingthe protein.

[0146] XI Protein Expression

[0147] Expression and purification of the protein are achieved usingeither a cell expression system or an insect cell expression system. ThepUB6NV5-His vector system (Invitrogen, Carlsbad Calif.) is used toexpress protein in CHO cells. The vector contains the selectable bsdgene, multiple cloning sites, the promoter/enhancer sequence from thehuman ubiquitin C gene, a C-terminal V5 epitope for antibody detectionwith anti-V5 antibodies, and a C-terminal polyhistidine (6×His) sequencefor rapid purification on PROBOND resin (Invitrogen). Transformed cellsare selected on media containing blasticidin.

[0148]Spodoptera frugiperda (Sf9) insect cells are infected withrecombinant Autographica californica nuclear polyhedrosis virus(baculovirus). The polyhedrin gene is replaced with the polynucleotideby homologous recombination and the polyhedrin promoter drivestranscription. The protein is synthesized as a fusion protein with 6×hiswhich enables purification as described above. Purified protein is usedin the following activity and to make antibodies.

[0149] XII Production of Antibodies

[0150] The protein is purified using polyacrylamide gel electrophoresisand used to immunize mice or rabbits. Antibodies are produced using theprotocols below. Alternatively, the amino acid sequence of the expressedprotein is analyzed using LASERGENE software (DNASTAR) to determineregions of high antigenicity. An antigenic epitope, usually found nearthe C-terminus or in a hydrophilic region is selected, synthesized, andused to raise antibodies. Typically, epitopes of about 15 residues inlength are produced using an ABI 431A peptide synthesizer (ABI) usingFMOC-chemistry and coupled to KLH (Sigma-Aldrich) by reaction withN-maleimidobenzoyl-N-hydroxysuccinimide ester to increase antigenicity.

[0151] Rabbits are immunized with the epitope-KLH complex in completeFreund's adjuvant. Immunizations are repeated at intervals thereafter inincomplete Freund's adjuvant. After a minimum of seven weeks for mouseor twelve weeks for rabbit, antisera are drawn and tested forantipeptide activity. Testing involves binding the peptide to plastic,blocking with 1% bovine serum albumin, reacting with rabbit antisera,washing, and reacting with radio-iodinated goat anti-rabbit IgG. Methodswell known in the art are used to determine antibody titer and theamount of complex formation.

[0152] XIII Purification of Naturally Occuring Protein Using SpecificAntibodies

[0153] Naturally occurring or recombinant protein is purified byimmunoaffinity chromatography using antibodies which specifically bindthe protein. An immunoaffinity column is constructed by covalentlycoupling the antibody to CNBr-activated SEPHAROSE resin (APB). Mediacontaining the protein is passed over the immunoaffinity column, and thecolumn is washed using high ionic strength buffers in the presence ofdetergent to allow preferential absorbance of the protein. Aftercoupling, the protein is eluted from the column using a buffer of pH 2-3or a high concentration of urea or thiocyanate ion to disruptantibody/protein binding, and the protein is collected.

[0154] XIV Screening Molecules for Specific Binding Using Polynucleotideor Protein

[0155] The polynucleotide, or fragments thereof, or the protein, orportions thereof, are labeled with ³²P-dCTP, Cy3-dCTP, or Cy5-dCTP(APB), or with BIODIPY or FITC (Molecular Probes, Eugene Oreg.),respectively. Libraries of candidate molecules or compounds previouslyarranged on a substrate are incubated in the presence of composition, alabeled polynucleotide or protein. After incubation under conditions foreither a nucleic acid or amino acid sequence, the substrate is washed,and any position on the substrate retaining label, which indicatesspecific binding or complex formation, is assayed, and the ligand isidentified. Data obtained using different concentrations of the nucleicacid or protein are used to calculate affinity between the labelednucleic acid or protein and the bound molecule.

[0156] XV Two-Hybrid Screen

[0157] A yeast two-hybrid system, MATCHMAKER LexA Two-Hybrid system(Clontech Laboratories, Palo Alto Calif.), is used to screen forpeptides that bind the protein of the invention. A polynucleotideencoding the protein is inserted into the multiple cloning site of apLexA vector, ligated, and transformed into E. coli. cDNA, prepared frommRNA, is inserted into the multiple cloning site of a pB42AD vector,ligated, and transformed into E. coli to construct a cDNA library. ThepLexA plasmid and pB42AD-cDNA library constructs are isolated from E.coli and used in a 2:1 ratio to co-transform competent yeastEGY48[p8op-lacZ] cells using a polyethylene glycol/lithium acetateprotocol. Transformed yeast cells are plated on synthetic dropout (SD)media lacking histidine (-His), tryptophan (-Trp), and uracil (-Ura),and incubated at 30° C. until the colonies have grown up and arecounted. The colonies are pooled in a minimal volume of 1× TE (pH 7.5),replated on SD/-His/-Leu/-Tip/-Ura media supplemented with 2% galactose(Gal), 1% raffiniose (Raf), and 80 mig/mill5-bi-roio-4-chloro-3-indolyl-β-d-galactopyranoside (X-Gal), andsubsequently examined for growth of blue colonies. Interaction betweenexpressed protein and cDNA fusion proteins activates expression of aLEU2 reporter gene in EGY48 and produces colony growth on media lackingleucine (-Leu). Interaction also activates expression of β-galactosidasefrom the p8op-lacZ reporter construct that produces blue color incolonies grown on X-Gal.

[0158] Positive interactions between expressed protein and cDNA fusionproteins are verified by isolating individual positive colonies andgrowing them in SD/-Trp/-Ura liquid medium for 1 to 2 days at 30C. Asample of the culture is plated on SD/-Trp/-Ura media and incubated at30° C. until colonies appear. The sample is replica-plated onSD/-Trp/-Ura and SD/-His/-Trp/-Ura plates. Colonies that grow on SDcontaining histidine but not on media lacking histidine have lost thepLexA plasmid. Histidine-requiring colonies are grown onSD/Gal/Raf/X-Gal/-Trp/-Ura, and white colonies are isolated andpropagated. The pB42AD-cDNA plasmid, which contains a polynucleotideencoding a protein that physically interacts with the protein, isisolated from the yeast cells and characterized.

[0159] All patents and publications mentioned in the specification areincorporated by reference herein. Various modifications and variationsof the described method and system of the invention will be apparent tothose skilled in the art without departing from the scope and spirit ofthe invention. Although the invention has been described in connectionwith specific preferred embodiments, it should be understood that theinvention as claimed should not be unduly limited to such specificembodiments. Indeed, various modifications of the described modes forcarrying out the invention that are obvious to those skilled in thefield of, molecular biology or related fields are intended to be withinthe scope of the following claims.

1 62 1 790 DNA Homo sapiens misc_feature Incyte ID No 2045674CT1 1ctgttgctcg agcccttagc aatatatacg taaacatatc cagcttgtct aacacatcac 60agattattag ttaacaaggt gtagattaat gagcttatat tgtattgctg gatcttttga 120gttaataaca atggtaactt gtccagaagg cctatcatca ttcctagtag gtgggcacag 180agtaagagat attaagaagc ttcctgatga gtcatcatct agcgaaggcc ctgtgtaggg 240ctttattata ggagttacat tgacttctgg ggcattcaaa ggtctcccct cttatccata 300tctctgtcat tttgcccacc tactaggaat gatgataggc tttaataaca atggtaactt 360gtccagaagg cctatcatca ttcctagtag gtgggcacag agtaagagat attaagaagc 420ttcctgatga gtcatcatct agcgaaggcc ctgtgtaggg ctatgttata ggagttacat 480tgacttctgg ggcattcaaa ggtctcccct cttatccata tctctgtcat tttgcttctc 540cagccacgac aacacacttt cctctccaac tgctccctcc ccaccaaaaa agaagaccct 600ctaaaaggca aaggaataaa tattcttaga agtaaagtat cttcatacat gctgcctttt 660tcaaagaggt gttaggatat ttatcctatt tctgtatttc acagtagctt ttcaggctgt 720cctgcttatg tataagctga tttctcgtgc cgaattcttg cctcgagggc caaattccct 780atatgatcgt 790 2 459 DNA Homo sapiens misc_feature Incyte ID No188552CT1 2 ggcacgagct gacatgagtc tcagtgccgg caaacacggc tggttgaaccctgagctagc 60 ccagctgctt tgttcacctt acgtttgggg aaggctgaaa ttttattgagcaccgactgt 120 attccacaca ctcttctagg tgcccgaaat atgctgttaa acaaatactcagccctcatg 180 gggctgagag tctggtgggg aagacctgtt gaaaaacaat catattaaatgaattgcatt 240 gcatgttaga agatcgtaag tactctgggg gaaaatgaga gtagaacaggataagggggt 300 gatggaggga atgagtggtg attttaaatg tagttatcag gctgggcacaatggcttaca 360 cctgtaatcc cagcattttg gaaggccaag acgggcaggt cacttgaagtcaggagtttg 420 agaccagcct ggccaacatg gtgaaaacct gtctctact 459 3 517 DNAHomo sapiens misc_feature Incyte ID No 465676CT1 3 gtggccagag ccagccagcatggccaccct caagaggcga gatgagccca cagaggcata 60 tcctgcgggg atgctgggctcccagtgtgg ttggcctgaa caaaataaag tgttgactcc 120 tgggcatctg tgccttctctatggccttgc tacctgggat tccagagagt tgatggggtg 180 cagatagggg taggactgttagaatagaac caacccaaac tgtgtgtagt ttggggtgta 240 tacttctatt tctcttcctacatgtctaca tgccatgacc ttcctcctcc tcttcacttg 300 gccagtttca gctcacttcctccaggaagt ctttcctgat atatcaaact gaaacaaatg 360 ctcctcctcc atgctcccttaatccccatg cttgtcgatt atattccttt gccaattcat 420 ttctctatcc tgtctatgtataagtgtgta caagcattca agaaactgat gaatgatgaa 480 tgaatgaatg agccaaagaacaaataaatg agcccct 517 4 824 DNA Homo sapiens misc_feature Incyte ID No3601719CB1 4 gtttaagttc ccctccagcc ccgagccagg agcagttctc aataccgggagaggcacaga 60 gctatttcag ccacatgaaa agcatcggaa ttgagatcgc agctcagaggacaccgggcg 120 ccccttccac cttccaagga gctttgtatt cttgcatctg gctgcctgggacttccctta 180 ggcagtaaac aaatacataa agcagggata agactgcatg aatatgtcgaaacagccagt 240 ttccaatgtt agagccatcc aggcaaatat caatattcca atgggagcctttcggccagg 300 agcaggtcaa ccccccagaa gaaaagaatg tactcctgaa gtggaggagggtgttcctcc 360 cacctcggat gaggagaaga agccaattcc aggagcgaag aaacttccaggacctgcagt 420 caatctatcg gaaatccaga atattaaaag tgaactaaaa tatgtccccaaagctgaaca 480 gtagtaggaa gaaaaaagga ttgatgtgaa gaaataaaga ggcagaagatggattcaata 540 gctcactaaa attttatata tttgtatgat gattgtgaac ctcctgaatgcctgagactc 600 tagcagaaat ggcctgtttg tacatttata tctcttcctt ctagttggctgtatttctta 660 ctttatcttc atttttggca cctcacagaa caaattagcc cataaattcaacacctggag 720 ggtgtggttt tgaggaggga tatgatttta tggagaatga tatggcaatgtgcctaacga 780 ttttgatgaa aagtttccca agctacttcc tacagtattt tggt 824 5969 DNA Homo sapiens misc_feature Incyte ID No 305781CT1 5 cccttttttttttttttttt tttttttttt tttttttttt ttttttggga gtatagatta 60 tgtttattttctctataatt tccagggttt tccaaaattt tacaacaaac atctataatt 120 ttatgaacactccccatctt atttttaaaa agaaaaaagt tggggggcag agaaatgccc 180 agctcagtactgagatccat caagtgaggc cagccggtat ctgtcacacc aggcagaggc 240 cccgtgctggaagccctgga ggttaccagc tccaagcctg gtatccaagg cctcctgggc 300 agccttagcctcctccttcc ctttcctccc accagaccct gctcctggga tgtccttctc 360 ccattaccaccacaaaatcg gactaatttt tcagggccca acaccaattc tgctaatttt 420 tttttggctcaatcttggct catcacaacc tccgcctccc aggttcaacg gattctccca 480 cctcggccttctgaatagcc gggattacag gcacctgcca ccacgcctgg ctaatttttg 540 tatttttagtagagaccggg ttttgccatg ttggccacgc tggtctccaa ctcctgacct 600 caggtgatctgcccgccttg gcctcccaat ctccttccat ttattagttg gattgcttaa 660 aaaaaaaaaagactccccga tatgggcagg agcaatgctg attttttact tacctgtctc 720 tagataatgaattgattgtt agcctccaaa gatgatcaat ttgtttttgt ttttgttttt 780 gtttcagattacggtgaact catggactta aacttcttta tgggttttga gccactgcaa 840 ttatcctcaccaaatctcaa gctgtcccac ctctggcacg tggggcctct tcaagttttc 900 ctcattcatatttgtttgtc tgtttgttgt ttttgggtgg ccagcaggag agcatccaca 960 gtctgtctc 9696 597 DNA Homo sapiens misc_feature Incyte ID No 971441CT1 6 aagaggtaagcgtggcctga cctagccacc caccaacagg aataatggct gaaaaagcgg 60 ggtctacattttcacacctt ctggttccta ttcttctcct gattggctgg attgtgggct 120 gcatcataatgatttatgtt gtcttctctt agaaaggcaa gaagatatca gattgacatc 180 atttagaagaattaagaaaa ctatgaacat gactgattat taaatgtctc atgttaaaca 240 atgcaatgtttgacatcact ttacaaactt ggatcataaa ctggcacttt ggtatgcata 300 agaatttcttcaggacaata agaaattatg agtgaatttc tctatattct gagtgagaaa 360 aatgtttagctgtgatgaaa aatgcatgtc attaaaaaaa gtttgataaa tttaatcaca 420 ttacaaaaaattatcccccc ttccctctgg aaaaaactat agagaaagtg ggctgaggct 480 gtgcaaggtggctcatgcct gtaatcccag cactttgtga ggatcctttg agcccagaaa 540 ttggagaccttcctaggcga cagagagaga ccccatctct acaaaaaaaa aaaaaaa 597 7 1918 DNA Homosapiens misc_feature Incyte ID No 3445829CB1 7 cagcctgcca cttgcctccctgcctgcttc tggctgcctt gaatgcctgg tccttcaagc 60 tccttctggg tctgacaaagcagggaccat gtctaccttt ggctaccgaa gaggactcag 120 taaatacgaa tccatcgacgaggatgaact cctcgcctcc ctgtcagccg aggagctgaa 180 ggagctagag agagagttggaagacattga acctgaccgc aaccttcccg tggggctaag 240 gcaaaagagc ctgacagagaaaacccccac agggacattc agcagagagg cactgatggc 300 ctattgggaa aaggagtcccaaaaactctt ggagaaggag aggctggggg aatgtggaaa 360 ggttgcagaa gacaaagaggaaagtgagga agagcttatc tttactgaaa gtaacagtga 420 ggtttctgag gaagtgtatacagaggagga ggaggaggag tcccaggagg aagaggagga 480 agaagacagt gacgaagaggaaagaacaat tgaaactgca aaagggatta atggaactgt 540 aaattatgat agtgtcaattctgacaactc taagccaaag atatttaaaa gtcaaataga 600 gaacataaat ttgaccaatggcagcaatgg gaggaacaca gagtccccag ctgccattca 660 cccttgtgga aatcctacagtgattgagga cgctttggac aagattaaaa gcaatgaccc 720 tgacaccaca gaagtcaatttgaacaacat tgagaacatc acaacacaga cccttacccg 780 ctttgctgaa gccctcaaggacaacactgt ggtgaagacg ttcagtctgg ccaacacgca 840 tgccgacgac agtgcagccatggccattgc agagatgctc aaagtcaatg agcacatcac 900 caacgtaaac gtcgagtccaacttcataac gggaaagggg atcctggcca tcatgagagc 960 tctccagcac aacacggtgctcacggagct gcgtttccat aaccagaggc acatcatggg 1020 cagccaggtg gaaatggagattgtcaagct gctgaaggag aacacgacgc tgctgaggct 1080 gggataccat tttgaactcccaggaccaag aatgagcatg acgagcattt tgacaagaaa 1140 tatggataaa cagaggcaaaaacgtttgca ggagcaaaaa cagcaggagg gatacgatgg 1200 aggacccaat cttaggaccaaagtctggca aagaggaaca cctagctctt caccttatgt 1260 atctcccagg cactcaccctggtcatcccc aaaactcccc aaaaaagtcc agactgtgag 1320 gagccgtcct ctgtctcctgtggccacacc tcctcctcct ccccctcctc ctcctcctcc 1380 ccctccttct tcccaaaggctgccaccacc tcctcctcct ccccctcctc cactcccaga 1440 gaaaaagctc attaccagaaacattgcaga agtcatcaaa caacaggaga gtgcccaacg 1500 ggcattacaa aatggacaaaaaaagaaaaa agggaaaaag gtcaagaaac agccaaacag 1560 tattctaaag gaaataaaaaattctctgag gtcagtgcaa gagaagaaaa tggaagacag 1620 ttcccgacct tctaccccacagagatcagc tcatgagaat ctcatggaag caattcgggg 1680 aagcagcata aaacagctaaagcgggtaag taaccagaga acagacatag gggcacagat 1740 aaagtaaatg agttgtcctccattgcatgg tggtaccaaa gtcacctctc acaatactta 1800 tcaatacttt caatattttagtatgcgaga gcaaacacac caagtttgaa acattaggag 1860 caggcacaca agtgagcacatttctatttg agaggaacgc ctgggccgct ttcccagg 1918 8 1079 DNA Homo sapiensmisc_feature Incyte ID No 189299CT1 8 gtcaagctct acctgagcga caaccacctcaatagcctgc ctccggagct ggggcagcta 60 cagaacctgc agattctggc cttggatttcaacaacttca aggctctgcc ccaggtggtg 120 tgcaccttga aacagctctg catcctctacctgggcaaca acaaactctg cgacctcccc 180 agtgagctga gcctgctcca gaacctcaggaccctgtgga tcgaggccaa ctgcctcacc 240 cagctgccgg atgtggtctg tgagctgagtctccttaaga ctctgcatgc cggctccaac 300 gccctgcgtt tgctgccagg ccagctccggcgcctccagg agctgaggac catctggctc 360 tcgggcaacc ggctaactga ctttcccactgtgctgcttc acatgccctt cctggaggtg 420 attgatgtgg actggaacag catccgttacttccccagcc tggcgcacct gtcaagtctg 480 aagctggtca tctatgacca caatccttgcaggaacgcac ccaaggtggc caaaggtgtg 540 cgccgtgtgg ggagatgggc agaggagacgccagagcccg accctagaaa agccaggcgc 600 tatgcgttgg tcagagagga aagccaggagctacaggcac cagtccctct acttcctcct 660 accaactcct gaggagcttc agttgcaagtcaatgccaag gacccaactg cagcatgttc 720 tggaagcctc tccattggag tggaaaggatggctctgggt catttgggag tggctctgct 780 agtagagact gatggagaga gccaggtggaatgccataaa tcacactgag aaaatatttc 840 tggcaaacag ctcctctttc agaggggagttgtgtgccca atgatggcat gacaaatcca 900 gagatcataa cttcctttgc gaagaagaacagctcgtcca cagcattgta tttttggaga 960 cacttgaaag agccaaaaga ggggcttgggaaacatcctg aaacctccct ggaagtctct 1020 caggaaattt gacttgggca ttggaggctccattgggctc cttccaatta aggggtgtt 1079 9 1028 DNA Homo sapiensmisc_feature Incyte ID No 2396760CT1 9 gtactgactc actataggga atttgcgcctcgaggcaaga attcggcacg aggggctgtt 60 accaggacaa ccggagcgat tgaccgttatctgcggtttg gagccgttag cgggagaggc 120 agagatattc agaggtcttt taggatgtgctaaagggtcg tgagggctct cttaaaattt 180 tcttcacaag cggttatcca gtcgtgccccgcggccctgc tgctggcccc ggggatctga 240 gtcgtaccct cttgtttttc tctgagtcagtcttaaggtg aaatgaagtg tggcccagtg 300 gctcctcact gtcgcttctc tagttttctgcctcctttta gaaaattgaa ttgaaaagac 360 aggatgaagt ggacacagca tgtgaagacaattctttcaa gaagtttggc tgtcaaggaa 420 aacagagaat gtgctaaaga acatacagacacagagcaga caggccacct ttgcaaccac 480 atggaggttt gtctgatatt gaagctaaagaagctaagct ggaagacaga gagaccaagt 540 cctgatgaca ttgtttgaac ccagagatccagacatgcct gaaaactagt tttaccactg 600 gacttatccg ttgaatgagc caataaactctcttttatac ttaaccttgg gttttacctg 660 gatttttgtc attgacagct caaaatattctaatatagaa gtatacatca ttaaatcaac 720 atttcttttt ttctctgtct tattttaaatgtaactctat aaggtactct aaaagtattc 780 tacagtctca ctaagttaat ctgcaaatttggtaaaattc caatattaat cccaaaagta 840 ttttaagagc ttgtttttgt tgtttgcttgtttgggacta aacagaatta ctccaaaatt 900 cattgagaga aaaaaaaaac atgaaaaaaaaaacaagaaa atagaattca taaaaggaaa 960 ttgtattata taacaaagca taaaacaagaataataaaca tagagtggta atggaataaa 1020 tagaacac 1028 10 1149 DNA Homosapiens misc_feature Incyte ID No 919893CT1 10 tcgttctcac tgagcacgatattaggctct ctcccaactc actctattct gtcctcactc 60 ctgttttgat ttttctcttgccatgtttga aatgttttat gggaatgtat tagaactctt 120 ttcttctaag gactgagacttccaggggat tgccatctta cctgtctctt ctccatgagg 180 gagaaggaag cagctagctatgtccctagc tgcaggaagc ccctattttt tccaagcacg 240 aagccaccag tctcccccagggagcatcag gaagggacat ggatgtgctc ctgccacagg 300 gcccttccta cctttggatctgtgagaagg tgaatacaaa gcagcaggca gagtaaaatc 360 tgctgggact gcctggagatttgtcaggag ctgcagacaa gtaccttgga gcattctgtt 420 atttttggaa agttcaaatatgcagggaca aggaggttgc tgactgtact gacaggctct 480 aagtcatttt ctccaaaaactatctattca attatcaggg gctggtcttg aggaaggaaa 540 aaaaaaaaaa acgttcccagaattcagttt ccaaaatctc tttttaaagg gtttacacac 600 acacacacac acacacacacacacacacac acacacacac gatcattaaa aagtgtatgc 660 tctttaagaa gaaaagtaaaatatctcaaa ggacggtttc accaccgtcc tttattgaat 720 caatttttct acatttcagagcaagtgtag attctgaggg actcctattt gccaaaaaga 780 caaaactagc aaaaaaaaaaacaaaaaaac aaaaaaaaaa ccacttaaaa ggtagcagga 840 aaagaaggta gttttgagtgtggttcactc agtgtctgtg agtctggtgt agtgtcagga 900 gtaaggccgt gtctagctcaagtttacatt tggatgtcct acaacactaa acaaaatttt 960 tcataatcca tggtggggagcacactttgg agctacattt cttgtctcct cattgttgac 1020 attaattaaa catttataggccaggcacag tggctcacgc ctgttatccc agcactttgg 1080 gaggccgagg caggtgaatcacctgaggtc aggagtttga aaccagcctg gccaatatgg 1140 tgaaaccca 1149 11 1467DNA Homo sapiens misc_feature Incyte ID No 2837330CB1 11 ctaaggcttatagattgcca gcctgctcag cgtctctaac ccttttcagg tctctgctgg 60 tggttctgaagccaaacctc tgatcttcac atttgtcccc actgtcagaa gactaccaac 120 ccatactcagttggctgaca cctctaaatt ccttgttaaa attccagaag aatcaagtga 180 taagagtccagaaactgtaa ataggtctaa atccaatgac tacttgacct tgaatgctgg 240 gagccaacaagagagagacc aagcgaaatt gacttgtcct tcagaggtca gtggaacgat 300 tttacaagaaagggaattcg aagcaaacaa acttcaaggg atgcagcaaa gtgacctctt 360 caaagctgaatatgtcctta ttgtggactc cgaaggggaa gatgaggctg caagcagaaa 420 agttgaacaaggccccccag gggggaattg gcaccgcagc tgtccggccc aagtctctag 480 ctatctcgtccagtctggtc tctgatgtag tgcgtcccaa aacacagggg actgatctca 540 agacctcatcacatcctgaa atgcttcatg ggatggcccc tcagcaaaag catgggcagc 600 aatacaagaccaagtcaagc tacaaggctt ttgcagcatt ccctacaaac acattgcttt 660 tggaacagaagactcctaca actcttccaa gagcagctgg tcgagaaacc aaatatgcaa 720 atctctcctcaccaacttct acagtatctg agagtcagct gactaagcct ggagtaattc 780 gcccagtacctgtaaaatcc agaatattac tgaaaaaaga ggaggaagtc tatgaaccca 840 accctttcagtaaatacttg gaagataaca gcgacctctt ttctgaacag gatgtaacag 900 tccctcccaagcctgtctcg ctccatcctt tatatcagac taaactctat cctcctgcta 960 agtcactgctgcatccacag accctctcac atgctgactg tcttgcccca ggacccttca 1020 gtcatctgtccttctccttg agtgatgaac aggagaattc tcacaccctc ctcagtcaca 1080 acgcatgcaacaagctgagt catccaatgg tggctattcc tgaacatgaa gctcttgatt 1140 ccaaagagcaatgaagttgg agcagaggct gaaaacacag gctgctgaag ttttttggaa 1200 tgctggtgctaaccacttgc tagatttaac tttttttttt ttttccagaa tgagtgctcc 1260 ctttatgagtgcagtgcagc agaaccaaaa aaaaagtttg ctgcaattat atagcatcac 1320 agtgctctgctaacagccag catagaagag atttacctac agctttttgc accactgttc 1380 tagcctttaatgccttctac ttaatattaa gctgaccgca atactaacgt gcccctatat 1440 ttggcagccaaataaagaag aatcgtg 1467 12 1691 DNA Homo sapiens misc_feature Incyte IDNo 1737459CB1 12 cggctcgagg agaaagaggt ttttaaattc tccatgaagt gtactatgttccatcattcc 60 ttcccaaagc caccggaagc attccttcta ggaaaggtgg agtcggtagtgagaagccgg 120 aggtgagaag acccctgagc ggatggattc attcattttc tgaatttcctatgtgaggac 180 agtattagag cccagtgagg ctttgagagg ccccaaagat gagcgccaacagtagcagag 240 tgggccagct tctcttgcag ggttcagcgt gcattaggtg gaagcaggatgtggaagggg 300 ctatctacca cctagccaac tgcctcttac tcctgggctt catggggggcagtggggtgt 360 atggatgctt ctatcttttt ggcttcctga gtgcaggtta cctgtgctgcgtgctgtggg 420 gctggttcag tgcctgtggc ctggacattg ttctttggag cttcctgctggctgtggtct 480 gcctgctcca gctggcacac ctggtatacc gtctgcgtga ggacaccctccctgaggagt 540 ttgacctcct ctacaagacg ctgtgcctgc ccttgcaggt gcccctacagacatacaagg 600 agattgttca ctgctgtgag gagcaggtct taactctggc cactgaacagacctatgctg 660 tggagggtga gacacccatc aaccgcctgt ccctgctgct ctctggccgggttcgtgtga 720 gccaggatgg gcagtttctg cactacatct ttccatacca gttcatggactctcctgagt 780 gggaatcact acagccttct gaggaggggg tgttccaggt cactctgactgctgagacct 840 catgtagcta catttcctgg ccccggaaaa gtctccatct tcttctgaccaaagagcgat 900 acatctcctg cctcttctcg gctctgctgg gatatgacat ctcggagaagctctacactc 960 tcaatgacaa gctctttgct aagtttgggc tgcgctttga catccgccttcccagcctct 1020 accatgtcct gggtcccact gctgcagatg ctggaccaga gtccgagaagggtgatgagg 1080 aagtctgtga gccagctgtg tcccctcctc aggccacacc cacctctctccagcaaacac 1140 ccccttgttc tacccctcca gctaccacca actttcctgc acctcctacccgggccaggt 1200 tgtccaggcc agacagtggc atactggctt ctagaattcc tctccagagctactctcaag 1260 ttatatccag gggacaggcc cctttggctc caacccacac gcctgaactttaaggatcat 1320 tggactatct tctctgtggc cagcgcagct ctcttctgtg ttcacagaatggccactgat 1380 aggcacgcct cttttcccac ccactggaag gctcacaggc aaggtgagagaggacacaga 1440 aggtgccaac actgtcgcta cagtaaggac ctgaagtgac tttgagaaattcaccctcac 1500 aaaccttcct tcaggagcag gcattggtag tgcagaggca cagattccgtcctttaccag 1560 ctgcagaatc ttgggcaagt tacatagcct ctgtgagcct catcggtaaacagtgggggt 1620 tatgaaaccc acctcacagg gttgttgtga ggatccaatg agttgatttaggtaagcacc 1680 tagcacatgc c 1691 13 2379 DNA Homo sapiens misc_featureIncyte ID No 058201CB1 13 cccaggatct gctctgaaac caggtctcta agtgaacatttctcaggcat ggatgcattt 60 gagagtcaaa ttgttgagtc gaagatgaaa acctcttcatcacatagctc agaagctggc 120 aaatctggct gtgacttcaa gcatgcccca ccaacctatgaggatgtcat tgctggacat 180 attttagata tctctgattc acctaaagaa gtaagaaaaaattttcaaaa gacgtggcaa 240 gagagtggaa gagtttttaa aggcctggga tatgcaaccgcagatgcttc tgcaacatga 300 gatgagaacc accttccaag aggaatctgc atttataagtgaagctgctg ctccaagaca 360 aggaaatatg tatactttgt caaaagacag tttatccaatggagtgccta gtggcagaca 420 agcagaattt tcataagtcc tgcttccgat gccaccattgcaacagtaaa ctaagtttgg 480 gaaattatgc atcacttcat ggacaaatat actgtaaacctcactttaaa caacttttca 540 aatccaaagg aaattatgat gaaggttttg gacataagcagcataaagat agatggaact 600 gcaaaaacca aagcagatca gtggacttta ttcctaatgaagaaccaaat atgtgtaaaa 660 atattgcaga aaacaccctt gtacctggag atcgtaatgaacatttagat gctggtaaca 720 gtgaagggca aaggaatgat ttgagaaaat taggggaaaggggaaaatta aaagtcattt 780 ggcctccttc caaggagatc cctaagaaaa ccttaccctttgaggaagag ctcaaaatga 840 gtaaacctaa gtggccacct gaaatgacaa ccctgctatcccctgaattt aaaagtgaat 900 ctctgctaga agatgttaga actccagaaa ataaaggacaaagacaagat cactttccat 960 ttttgcagcc ttatctacag tccacccatg tttgtcagaaagaggatgtt ataggaatca 1020 aagaaatgaa aatgcctgaa ggaagaaaag atgaaaagaaggaaggaagg aagaatgtgc 1080 aagataggcc gagtgaagct gaagacacaa agagtaacaggaaaagtgct atggatctta 1140 atgacaacaa taatgtgatt gtgcagagtg ctgaaaaggagaaaaatgaa aaaactaacc 1200 aaactaatgg tgcagaagtt ttacaggtta ctaacactgatgatgagatg atgccagaaa 1260 atcataaaga aaatttgaat aagaataata ataacaattatgtagcagtc tcatatctga 1320 ataattgcag gcagaagaca tctattttag aatttcttgatctattaccc ttgtcgagtg 1380 aagcaaatga cactgcaaat gaatatgaaa ttgagaagttagaaaataca tctagaatct 1440 cagagttact tggtatattt gaatctgaaa agacttattcgaggaatgta ctagcaatgg 1500 ctctgaagaa acagactgac agagcagctg ctggcagtcctgtgcagcct gctccaaaac 1560 caagcctcag cagaggcctt atggtaaagg ggggaagttcaatcatctct cctgatacaa 1620 atctcttaaa cattaaagga agccattcaa agagcaaaaatttacacttt ttcttttcta 1680 acaccgtgaa aatcactgca ttttccaaga aaaatgagaacattttcaat tgtgatttaa 1740 tagattctgt agatcaaatt aaaaatatgc catgcttggatttaagggaa tttggaaagg 1800 atgttaaacc ttggcatgtt gaaacaacag aagctgcccgcaataatgaa aacacaggtt 1860 ttgatgctct gagccatgaa tgtacagcta agcctttgtttcccagagtg gaggtgcagt 1920 cagaacaact cacggtggaa gagcagatta aaagaaacaggtgctacagt gacactgagt 1980 aaaatatcta tggccactga cagtccacac ttaggcactgagagatattg atgttctgaa 2040 ataagatttt atgaatttgg ataccctttt gaggaacttgatgtaaacat ggtgttcaga 2100 aatctcgtgt ctatctcaat gggatatttc ttgtattacaccttgtcatt tttttcacaa 2160 tttatttaca tctacttttg tttgaactgg aatgaagagatgaaacacta tggatatgtt 2220 ttccattcaa atggcacttt agcatattgt tctgttttcctgtaaaacat catgggtgtg 2280 atttttatac tgctgctgct tgtcacaatt attataacttctctgtaatt tcctctgaaa 2340 taaaattgaa tcacctgagg tgcaaaccaa aaaaaaaaa2379 14 1904 DNA Homo sapiens misc_feature Incyte ID No 767447CT1 14atgaatacaa atcgctcaga aagcattttg gtggcacaga aaggggatgt atttgtgttg 60agatcttatt ttattttgta tttatttatc ttctttgact tgcacagcac tattgggggt 120gggggaagca gggtagtggg agacgaaggc agaagcaaga gtcaaactca gaatgactga 180gttgaattca ctgtctagtc agcaatgcct gcttctgagt ttggcccaga gagaaggtat 240tgagtaagat tttaataact gtaaaaagta agctggataa gtaaaatcat gatggatcca 300aagcacagtt tcttcatctc ctgataaaga aagtcaaatg cttgataaat tcagagtcac 360agatgtgagc atagctatat tcttttaaac gagaggtaga gtgacctagc actaagcaaa 420tgagctgaaa tgtcggaaac agagtccatc agcttatttg gccacacgat cccaaactag 480ttttatcttg ggaaatggcc ctgtcctcag cattcccttc ttgtgctggt ggggccagtg 540aagtcttgat cttatcagaa aaaggccaca ccaagtgcga gttttcccag gctgactttc 600caggccctta tcaaatgaaa caacagaagc tcttcacagt tctgtgcccc atggccactc 660cacagacaga caataccaag catcttagaa ctgtcataag ataggtcatg cctgaaatag 720atcttgacca tatgagagtc ccagaaatca gcaaggcctg gacaaataga actaagagag 780aggcagaggc aggaagctgc gggtctatct tgtaaagagt ttagcatcac tgtgagagtg 840tgtgtctaaa attaaattaa actagaagca gcaggtgagt atttggtaag tacttctgtg 900actcgcctca attcccactg gccaggggcc atctcaactg cacggtgaat caagatgctg 960gtgtcatcct ccttggaaaa aggaaatgtt aactcatggt taaaactaag tacaatgatt 1020cccaagggat cactttctta tttttttaaa tgacattaag gagaatctta agaaagcatc 1080agagaaagac atgtgcatgt gaagcaccct gattctgatg ttaggaaaac ttaagcgaac 1140aggacctgct gcacacagcc ccattgtctt ctatccattt ctctttatca ttcaaatcaa 1200gcaacatgtg ccctcctcat caacacacat tcttcccctt tgtcagtatg catctcccag 1260cttagtgtca ggatactttc gattcataat tatgtatgat ccaaagtgtg cataatttca 1320tttaacgtta aagaaataga tccaattcct ttcttgcaac caaaaataaa taaaatacgt 1380tgcctcaata taaggtttgg gctattctgt gtttctatag aagcaatctg tttttggtaa 1440aatgtacttt taaggatcca gtcatctgaa gtattttatg tagagttaga gatttcacaa 1500tattgactat acatatattt aaaatataaa ttatccagct gatgtttgaa tttgtcttac 1560tttcctggcc acctcgttgt cctattttat aagctgggga gttaactagc ttaacaaaag 1620atgcttagct tttgtaaaag aacaagtgtt tcattttaca aagacactcc aaatgatagt 1680tacttgattt tctcgagacc tttaactatg gtgatgaata acaggacttg ctttcaagcc 1740ttaataaatg taaaatgcct tttaatgaag atacagctga gtgttttcct catgaatctg 1800aaccaattac caatttgtgt tccagtcttg attggtattg actgattcaa ataaagttgg 1860tttattttca aatattaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1904 15 968 DNA Homosapiens misc_feature Incyte ID No 5449893CB1 15 gaatccaggg gaagggatggaagaggaaga aaaacaagct tggagcagtc caacccagct 60 agggccctcc attccctcagggacacccca cacccacccc acacactggg atgaaccctt 120 gcagaggaac aattcagatggtcacacatt ccaggaccca aatccgtaaa cacaaagcat 180 gtccgtcagt gccagcacctccccccggct aatcaagcag ctgtcccaga gggcaaaggg 240 tctctgcagc catctgctttcatcagggct gcagccccca ggcagcagta ctgggagccc 300 ctctcatctc cgagaataaactctgaagcc agcgaccctg cggacctgaa tcatcaggga 360 gcctgtcaga ggaggggcagtgactctgcg ggacaagcaa gcaggctata taagtttcag 420 aaggctgggc tccactcagatcttttccag cagctgctgc ctgccagaga ggcgccttca 480 gagacccagc gcttacacaatacccaccat gtcccaggct ggtgctcagg aagcccctat 540 caagaagaag cgcccccctgtgaaggagga ggacctgaag ggggcccgag gaaacctgac 600 caagaaccag gaaatcaagtccaagaccta ccaggtcatg cgagagtgtg agcaagctgg 660 ctcggccgcc ccgtcggtgttcagccgcac ccgcacaggt accgagactg tctttgagaa 720 gcccaaagcc ggacccaccaagagtgtctt cggctgagaa gtgtgcgcca ctccccttgc 780 tgcccgaatg ctcggaaacaggagccttac ccaggaactc ttttttatgc cagaacgctt 840 cctctcccct gctgtctctggggctgccac cctcccccac agtccaggcc cttcagccaa 900 gggctctgca ccagcaccttggaagcacca ataaagagga tgcccacgtg gccccagcaa 960 aaaaaaaa 968 16 1112 DNAHomo sapiens misc_feature Incyte ID No 2951269CT1 16 gaggcaagaattcggcacga agggtagacc tcacaggtgc ataaaatcat taataaagca 60 tgtagcacttgctaattggt gccttaagct tgaatctaat cagaattgca gactcgggtc 120 ctctgggaaaaaaacatgtc cgtctgtggc acgtgtgagt actaggccca ggggaagagt 180 ctgaaaattgaattcttttg tgtgtcctgt gtctcagaag agaactgaat gttcagagca 240 gcgtttgtaagctattaaca ttcagtattt cgtgttgcaa ctagaacaca ttattagatt 300 tattcctgtttaattcataa tggtgcagaa taaaacacac acatctgatt tgatttcttt 360 ttctttttttaagtttcata attgcttttt atggctagtg ttaatggcaa aaagtccttt 420 ccagggctccctgaataatc taccatacct gtatccatag caggtgatgc ttttttttat 480 ccccactttgaagacgtgtg tttctgtatt tacacataaa tcatactatt gtatattaaa 540 gacagcagtggttgaaaaga atgtgaacac tgtagaagtt atgttggaaa aaaggagagt 600 aaattgtgtgattaatgggg aaggatattg gataatgtta taccccggac tatgaaaaaa 660 gctggtggtaaatgggaaga atgtgaaatt ttaaactgct ctcaacgtag gaatcttggt 720 ggaaaagttcctacctgagg tctgatatga ttcaattata gaatgcaatg agcttggcca 780 aggggactttgaatccagcc aaggaaactt tgaatctcga cagctctgag aatcacattt 840 tcagtgcattgaatatggag taaactattt agacaaggat tctgtgagac taggctactt 900 acctttaattgccagcattt gtaaatgatt gtgcaatctt gtgtaatggt cttttatttt 960 gactgttttggaaaaaaaat gttttattgt ttttttttcc cagtaaaaat tacttcaaag 1020 aaaaaaaaaaaaaaaaaaaa aaaaaaaaaa aaaaaaggcg gccgcaagct tattcccttt 1080 agtgagggttaattttagct tgcacttgcc gt 1112 17 1714 DNA Homo sapiens misc_featureIncyte ID No 282977CB1 17 ggaaagtgga agttggattc tgaaagatcg aggtgcccacaggaatttta tggtcgtcgg 60 attttgaaga cttgaactag actgggggtt ctccttgcatttcttgcctg ttgcctatct 120 ttgtcctctc tcttccggct tcgagatgaa tgtgcagccctgttctaggt gtgggtatgg 180 ggtttatcct gccgagaaga tcagctgtat agatcagatatggcataaag cctgttttca 240 ctgtgaagtt tgcaagatga tgctgtctgt taataactttgtgagtcacc agaaaaagcc 300 gtactgtcac gcccataacc ctaagaacaa cactttcaccagtgtctatc acactccatt 360 aaacctaaat gtgaggacat ttccagaggc catcagtgggatccatgacc aagaagatgg 420 tgaacagtgt aaatcagttt ttcattggga catgaaatccaaggataagg aaggtgcacc 480 taacaggcag ccactggcaa atgagagagc ctattggactggatatgggg aagggaatgc 540 ttggtgccca ggagctctgc cagaccccga aattgtaaggatggttgagg ctcgaaagtc 600 tcttggtgag gaatatacag aagactatga gcaacccaggggcaagggga gctttccagc 660 catgatcaca cctgcttatc aaagggccaa gaaagccaaccagctggcca gccaagtgga 720 gtataagaga gggcatgatg aacgcatctc caggttctccacggtggcgg atactcctga 780 gctgctacgg agcaaggctg gggcacagct tcaaagtgatgtgagataca cagaggacta 840 tgaacaacaa agagggaaag gcagtttccc tgcgatgatcacacccgcct atcagatagc 900 caaaagagcc aatgagctgg caagtgatgt gaggtaccatcaacaatatc aaaaagaaat 960 gaggggaatg gctggtccag ccattggagc tgagggcatcttgacaaggg aatgtgcaga 1020 ccaatatggc catggttacc cggaggagta ttaggagcacaggggacagg gcagcttccc 1080 agctatgatc actccagcat atcagaacgc caagaaagctcacgaactcg ctagtgacat 1140 aaatacaggc cggacttcaa taagatgaaa ggcactgcacattatcactc gcttccagct 1200 caagacaact tggttctcaa acgggctcag agcgtaaacaaactcgtgag tgagaataaa 1260 tataaagaaa actaccagaa ccacatgaga ggccgctatgaaggagttgg tatggacaga 1320 cgcactctgc atgctatgaa agttggcagc ctggcaagcaacgttgccta caaagctgat 1380 tataaacatg atattgtcga ctacaactac ccagccactctcacgccttc ctatcaaaca 1440 gctatgaaac tggtgccctt gaaagatgcc aattataggcagagcatcga caagttgaag 1500 tacagctcgg tgactgacac cccacagatt gttcaagccaaaatcaatgc ccagcagctg 1560 agtcatgtga attaccgtgc tgactatgag aaaaataagttgaattacac attgccccag 1620 gatgttcctc agctggtgaa ggccaaaacc aatgccaaactcttcagtga ggttaagtat 1680 aaagaaggct gggagaagac aaaggggaaa ggat 1714 18806 DNA Homo sapiens misc_feature Incyte ID No 3178454CB1 18 acttgtctcagtctggatca gactcaagtt gctctccaga atgcctctgg gaggaaggca 60 aagaagttatcccaactttc tttagtacca tgaacacaag ctttagtgac attgaacttc 120 tggaagacagtggcattccc acagaagcat tcttggcatc atgttgtgct gtggttccag 180 tattagacaaacttggccct acagtgtttg ctcctgttaa gatggatctt gttgaaaata 240 ttaagaaagtaaatcagaag tatataacca acaaagaaga gtttaccact ctccagaaga 300 tagtgctgcacgaagtggag gcggatgtag cccaggttag gaactcagcg actgaagccc 360 tcttgtggctgaagagaggt ctcaaatttt tgaagggatt tttgacagaa gtgaaaaatg 420 gggaaaaggatatccagaca gccctgaata acgcatatgg taaaacattg cggcaacacc 480 atggctgggtagttcgaggg gtttttgcgt tagctttaag ggcaactcca tcctatgaag 540 attttgtggccgcgttaacc gtaaaggaag gtgaccaccg gaaagaagct ttcagtattg 600 ggatgcagagggacctcagc ctttacctcc ctgccatgaa gaagcagatg gccatactgg 660 acgctttataagaggtccat gggctggaat ctgatgaggt tgtatgatgg ctgctgggca 720 gcacctcctaacttcaggga ataaagtgct aaagtgtaaa aaaaaataaa aataaaaata 780 aataaataaataaaattaaa aaaaat 806 19 555 DNA Homo sapiens misc_feature Incyte ID No3563859CT1 19 gccagacacc tgagccgact ggtagtaggg gcagccgtgt ggcggggagccggccgggcc 60 ttcctgctca tcgaggacct gactggctcc tgcttcgagc cactgccccagggtctgctg 120 ctccacgagc tgcctgaccg ccgcagctgc ctggcagccg gccaccagtggcgaggctac 180 accgtctcct cccacacctt cctgctcacc ttttgctgcc tgctcatggcagaggaagca 240 gctgtgttcg ccaagtacct ggcccatggg cttcctgccg gcgccccactgcgccttgtc 300 ttcctgctga acgtgctgct gctgggcctc tggaacttct tgctgctctgtaccgtcatc 360 tatttccacc agtacactca caaggtggtg ggcgccgcag tgggcacctttgcctggtac 420 ctcacctatg gcagctggta tcatcagccc tggtctccag ggagcccaggccatgggctc 480 ttcccccgtc cccactccag ccgcaagcat aactgaaaga aataaaaaccatcgggcctg 540 aaaaaaaaaa aaaaa 555 20 1159 DNA Homo sapiensmisc_feature Incyte ID No 985730CT1 20 taagatctac tcaaagtact tcaaacaaaaaataaataat tcattggctc atgtatcttg 60 gccacccagg gaaggtctga cattgttagttagatccaga gtttcaaatg tcatcaccat 120 ggatgtgtct ttttctctct ctcatttcccctccccatat cttgtctttt atttattata 180 ggtttgtctc attccctggc aggctctctccctgtgatag gaaagagagt ccccagcagc 240 cccagggtga catagttgtt atagttcattatggaataga agagagaaga gcattctcaa 300 taacccggca aagttcccag ggatgactctgatatgtcta tgtctcaggt cacatttcca 360 tctatgaacc aatcatattc agaggtggaatgctaattgg ccaggcctgg gtcatatata 420 caagtctagg gaagaaatga gcttcatccctgtccaattg acatggactg attaggggta 480 ttaatggaag aggtgtgcca ccacaaaagaatgtaccatg ggcagatcaa agaacatatt 540 ctgtatgtca ggcttggcac aaaagaatgacacaagtaat atgctgtaga tcagaacctc 600 tctgctaata ttgccttttt agcatggttaagatagctaa gatctagtac tgtcactcca 660 gtatgtccca attctaccta cgtttattgaagggtcaaca gttctgatct cagcattggg 720 taaagggtgg gacattcaga tttacggtccttgataaaaa caatttacaa cgttccgttg 780 tgtaataaat gtaagtgtac atatgcctgggacatcagct ggaaaaggga cagactatca 840 gagagttgca ctgttgcggt atgggccaaatccaacataa tacccgctgt acctctagag 900 aactaaaacc ttaatttctc agatcttttctgcactaatg gtctttacat acagcctaca 960 ttttaactaa ctcttgcatg ggcttgtttcacagcaggaa actatattca tcatatcctt 1020 attatgatag agaatgacaa cattcaaaagggtgtggtgc ttctgaaaat atacacaata 1080 aatggcatga tttgaaaaaa aaaaaaaaaaaaagatcggc gcaagcttat tccctttagt 1140 gagggttaat tttagttga 1159 21 878DNA Homo sapiens misc_feature Incyte ID No 3684987CT1 21 gtggcatccaccattaaggt taagtgtggt gtgccctgtg agtctgaatg tctacttaag 60 aaccttaagtagacattaag aaccttaaga aggttttttg tttgtttttg tttttttgtt 120 gttgagatggagccttgctc cgttgcccag gctggagagc agtggcgcaa tctcagctca 180 ctgcaacctctgcctcccag gttcaagcaa ttctcctgtc tcagcctccc gagtagctgg 240 gactgcaggcgcctgccccc aagcccggct aatttttgtg tttttagtag aaatggggtt 300 tcaccttgttggtcaggctt gtctcaaact cctgacctca ggtgatccac ccacctcggc 360 ctcccaaagtgctgggatta caggcatgag ccaccatgcc tagcccacaa actcttacca 420 ttcttaaatgtatttatttc agttcctctt ccactactat attataacct accctggcag 480 tccttctcatctgctgcaat atttcccatt ccttaagatc taacctatgc tgctccttct 540 ccatgaggctttttctcatt aattcatgca cactgatctc tcccttctct gcattcctgt 600 catacatcattatttcataa ttattttgca tgtgttgtac tttttctttt cagccacatt 660 cataagtctctggggaaaga aattaggctt tcatgatttt gtatccttat cctacacccg 720 gcaaagtgctgagtatacag taaattctca aaggctttat gtcttcttca atcgaaaaat 780 ttacacttgaagaaatttgt cttgtagcct atgaagtcaa acagtaccat taggaaacaa 840 taatcaagactccatgacct aaccatgtta tattatta 878 22 667 DNA Homo sapiens misc_featureIncyte ID No 986166CT1 22 gcgttcagga gacagtcacg gtactcgttt ccagacagaagtcatgagga acaagaggga 60 aggtgctttc ccgtgtgcag cgcttgggga gactcacacagacagaggat ctggcatgac 120 agggaaagga ggaaatggct tctgttaatc tctccttcagcttctcccgc ccttcccatg 180 cactcttcct gtttcccttt ccagttctca cggtgactcaaggaacaacg tgtgaaatga 240 aagacctcag gtgctgtatt ggctcttgac agctcttcagaagaaaatac ctcctgcctg 300 ttctgttcag tcctggtgca gcttccagga agccaaatgacccaccggct tacccacatc 360 gcaggaagct ttggagcaga gtcagtgact atgtgaacctgcctcaacct ctgctccctg 420 gttcagcatt tggcttggga aaaatgacac tatttcctgtctcttaaaca ttatttcaag 480 gcacaggtct tccaccattc tgagaggcag ggggatctttgagttctgcc aggagctggg 540 ggttaggggt aggggaatcc cgcccaaggg aaatgactagaatctttgtc aggctgtgga 600 acacaggcat tctggatagg tggctcccct gtggctctccctggaatcta catgcaaatc 660 cctgtat 667 23 1421 DNA Homo sapiensmisc_feature Incyte ID No 1887508CT1 23 tgatcagtga tatcaaacat caggaatcagcctttatgta acataacagc tgtcctccta 60 tggtgaaagg ttcaaatgta gtgaaggtataacctatatt gactgagatt tcccttttag 120 gtagtgcctt atctctatta ctagtgttaaaggaataagg aatctatgaa ggacagggag 180 cagctctggt ctgtcaatct cagccacctgtttgatatca cagagaagat actcggagga 240 ttgttggaat gtatatagtt tagtaagaagtgggtaagaa agagggtctt aattactgag 300 cacttattat gtattaggtt ctttgccagatgtttttaca tatataaact catttcagaa 360 aacttattta aagtaaatgg ggccgggtatggtggttcat gcctggaatc ctagcacttt 420 gggaggctga ggtaggagga ctgcttgaggccgggagttg gagaccagcc tgagcaacat 480 agtgagaccc tgtctcaata ataataataataataataat agtaataatg aagtaaatgg 540 gataaggaaa gaaggataat tatctttaaaggttgattcc caccctccct ccccagttac 600 ttaaggaact aagtgagtac atctccagttgcccatgaaa gcataagttt gttttcctca 660 gctgaggcaa gtggtagagt atacaggataacgaagtaac atgtaaaagg caggacgcac 720 ataaaggtgt acatggctat tgtttcacctggagaaacca catgattggg acctgaaggt 780 ttactgactg actacagggg ctgattgtgaagcacgagga accccatgtg tgtggagact 840 gtagggtgag agcacacaat tattagcatcatttctgagt gatctcacag attttttttc 900 ttgtgtttgt tttgcttttt gacaactgcttctcccacgt tccttgcaat tctattctct 960 caccttcact ttactatttg tattcgatggaccaggataa ttcaggcaag gttaccttgt 1020 aaacttgaat tggccacaca ccatgttgtcacccagctgg ctatgaagtg aataatggta 1080 ctgaaagtaa acctgaagac ctttctcagatctattttaa gtctgagtct gaccaaccat 1140 ggaaaatatt cgacatgaat taatgtagagaactataaag catttatgac agctccaaga 1200 aaagtcatct actctatgca ggagatatgtttagagacct ctcagaaaaa cttgcctggt 1260 ttgagggtac acagtaccat tttaatcttctgaaaatatc tgtattcctg ctctttttct 1320 gctgtcactg tcaatctgct atatttttcactatcctatt aaaatattac tgtctcctta 1380 aaaaaaaaaa aaaagggcgg ccgttcgcgatctagaacta g 1421 24 2630 DNA Homo sapiens misc_feature Incyte ID No1006416CT1 24 aataaaggag ctccaaatgt cgttgggtgg ggaagcaaaa tgtagagaaacatttaaagc 60 acactgtaat aataaatgca attataaact atatggagga gggtgcagaggagggaatgt 120 gtctggtgtg tgatgtgtgt gtgtgcagtg ggggtatcac agagagtatgacatctgagt 180 tgagggtagc aggtgcctgg agtctcaggt ggctgctcac ccatctgtgcaggtgtctct 240 ggggctgctg gtctcacctg tggtctgcag tagacacaat tggctgagcaggatatgtga 300 tactgtgtgg ttggtgtgga gttttgaaga aggggctgtg tttgggccacgtaggctcta 360 ctcagagacc tgaaaccact tcagaatggt gcatatgtcg aaagagctggctgggggcct 420 tgcccaaacc aactgaggtc ttaaagtccg gggaaaaaaa gtctgggttccaactagaat 480 tctagaaata tttctagaac acacagagag ggaataagtc cctctatcacccttattacc 540 aagccttgtg gttccctgtg attttagata atgtctgata tttttctggctatttgccta 600 gtaggattta aaaaatattt tcaaagtgaa gctgagagag aatcttggaaacacacatac 660 ctgttgatca tgggccctgc agaattggcc cttgggggct ttatttggttacatgtgcct 720 gggtggtctt taccagctta gactctatca tgggccccca tgaagctccattctcaatac 780 tgaataatta ttacttccct tgttgagttt ctttttctgt catgccctgggggcttctgc 840 tcttctcacc agaaagaaca tttgaatctg gattcttgta cacctgggttagaccctgtt 900 cagaggtgtg gccaatttat cccgatctcc tggaaggctg ttgtgatttccatctaagaa 960 atgagggtct tgagaatcaa ccagtcccaa gattagcctg ttatcctgttatctactgag 1020 acctcaaatt tctcaccaat gttttgggag atcctggaaa agatcccttcagtttggggt 1080 gtcaccaaga cttctacaca acccaggact accattgacc tcagagctgtaccccacatc 1140 ttgaagtaaa ttgatcccac caggtcccac gtttgttatc tctgcctaaatgttagcttc 1200 tccatcctca ccacatgatg acctgctgtg tccctctgag cactacccagtggctgaaaa 1260 ctctgcaaat gggccacact tttgcaaaat acttgtatct gacacttaggtcttgtttga 1320 agaatttcct ttctggaagg ttttacaaga agactgatag tctttcaagcccccacatca 1380 caggcttagg gacggcacta actttctccc agggatctaa ctggctagttcaaattatca 1440 ctcttttacc ttcatataaa atgtctcccc caaacctttt tcccttctttgtcattgtta 1500 tctgctaagc cactggtcat ttccccatat tcgtagtctt tttttccatcctatctttct 1560 aatatttgtt gtctttaaca aactgtgttc tgtgtctgtg ctcctccttccctctcagac 1620 cactggaatg caagtccttc ttccctttgg aatgtactct ggatcccttcccctgctttg 1680 acccccagac tttgctccat ctattattgc ttctccatcc tggatccttgacatttgtca 1740 ccccactggc cttctcaggt gcaatcagta aaaatgctga gaactcttggatcttaatct 1800 tcatgactga gtttttttta gttgtatagt tatcatctgc ctttcttcactttgcatttc 1860 ttcttgaatc cattgcagat tgacttccac tcccactcct tcactaaaagggctcttacc 1920 aagatcaaat ctaatgggta cattttagtt cctatgtgat ttggcctttcgatgtcaatc 1980 atcactccca gccattgatt ttggtgaccc acttccctgt gatgatcttctgatctagtt 2040 tctcaggttc cttcgctggt cctttttctt tccctgcccc tgacatattgacatttcctg 2100 gagttggttt tgtccttgat tcattctcat gtcattctgc acacagtctctgcatgaact 2160 caggcagacc cttcatttaa tgaccacctt agggctgatg attctcaaatctgtattccc 2220 cgatcttgca tttgagctcc agccccactc atcctctcgg atgttctgcaggcccagcaa 2280 actcatcatg tccaaagtga aactttttct ctttcctgtc tcctctcctctgatctgttc 2340 tttcttggaa caccacccaa gaacgtcacc tcctccatca gattgtgagctcctggaggg 2400 caggagctgt gtccttctat tcatcttcct atccccagaa ccttgcacagatcctggaat 2460 gtggtaggtg ctcagtaaat gtgtgttgaa taaatgaatg aatgaatgaacaaatgaatg 2520 aatttgctta cttcaaggca aaagaaccat gaaactgtat tttgagtttctatgttatag 2580 cagtcagcaa atcctattaa atactttgtg tttccaaaaa aaaaaaaaaa2630 25 1039 DNA Homo sapiens misc_feature Incyte ID No 975169CT1 25gttgacacgt tgtatgccat cctggatgag aagaaaagtg agttgctgca gcggatcacg 60caggagcagg agaaaaagct tagcttcatc gaggccctca tccagcagta ccaggagcag 120ctggacaagt ccacaaagct ggtggaaact gccatccagt ccctggacga gcctggggga 180gccaccttcc tcttgactgc caagcaactc atcaaaagca ttgtggaagc ttccaagggc 240tgccagctgg ggaagacaga gcagggcttt gagaacatgg acttctttac tttggattta 300gagcacatag cagacgccct gagagccatt gactttggga cagatgagga agaggaagaa 360ttcattgaag aagaagatca ggaagaggaa gagtccacag aagggaagga agaaggacac 420cagtaaggag ctggatgaat gagaggcccc cagatgcaga gagactggag agggtgggga 480ggggcccagc ggcccttggt gacaggccca gggtgggagg ggtcggggcc cctggagggg 540caatggggag gtgatgtctt ctctctgctc agagagcagg gactagggta ggaccctcac 600cgctgcgtcc agcagacact gaaccagaat tggaaacgtg cttgaaacaa tcacacagga 660cacttttcta cattggtgca aaatggaata ttttgtacat ttttaaaatg tgatttttgt 720atatacttgt atatgtatgc caatttggtg ctttttgtaa aggaactttt gtataataat 780gcctggtcat tgggtgacct gcgattgtca gaaagagggg aaggaagcca ggttgataca 840gctgcccact tcctttcctg agcaggagga tggggtagca ctcacaggga cgatgtgctg 900tatttcagtg tctatcccag acatacgggg tggtaactga gtttgtgtta tatgttgttt 960taataaatgc acaatgctct cttcctgttc ttcaaaggaa aaaaaaaaaa acaaaaggga 1020aaaaagggag agaaaagag 1039 26 1057 DNA Homo sapiens misc_feature IncyteID No 4152861CB1 26 ggagtcgggt tacaccactt gtgtctgagt tcacgcagcatgttcctctg tcagggattc 60 cgcaaatatc tccctgaggt aaaaaaggaa agtgtgctgcgctccagcac ccagagcagt 120 gagcccagtc ccgagtcccg gagagagctc cagcaataggggccatgtcg ccatagcccc 180 agcctctcgg tccgcagcct cagcagcgtc ccagccggctggcttcatgc tgcggtgcag 240 ctgcaccatg ttcctgggtt gagggggcaa tcgggcacgctcctccccat gggttgccca 300 tcatgtctaa tggatatcgc actctgtccc agcacctcaatgacctgaag aaggagaact 360 tcagcctcaa gctgcgcatc tacttcctgg aggagcgcatgcaacagaag tatgaggcca 420 gccgggagga catctacaag cggaacactg agctgaaggttgaagtggag agcttgaaac 480 gagaactcca ggacaagaaa cagcatctgg ataaaacatgggctgatgtg gagaatctca 540 acagtcagaa tgaagctgag ctccgacgcc agtttgaggagcgacagcag gagacggagc 600 atgtttatga gctcttggag aataagatgc agcttctgcaggaggaatcc aggctagcaa 660 agaatgaagc tgcgcggatg gcagctctgg tggaagcagagaaggagtgt aacctggagc 720 tctcagagaa actgaaggga gtcaccaaaa actgggaagatgtaccagga gaccaggtca 780 agcccgacca atacactgag gccctggccc agagggacaagatctaaaaa aaataatgct 840 gggaagtcct aaccacatca agaatgcctc agatcagtgacccaaggaac cttccagaat 900 ggatgaaata gacccaaagc tgaattcacc taattttagggccaaaaacc caaaaaacaa 960 aacaagacca aaaaaatctt cagatactgg gagaacaaatctcaattgct caattgtatc 1020 ttatgaaaac aatttttcaa aataaaacaa gagatat 105727 1363 DNA Homo sapiens misc_feature Incyte ID No 986464CT1 27gaaatcacac agaggccaga ggtcacacag cctcaactgc cccttccacc aggaggcagg 60agacatcaag agagtatttg tgccctcctc gggttttacc ttccagccga gattctccct 120cctccccaac atttatctcc atccagtcgg ccacaaggaa gcctctagag actcccagct 180ttaagggcaa ccctgatgtc tcagtgaaaa gcacacaact ggctcaggac ataggccagg 240ccctgctcca ccagaaaggt gtccaagaca aaactgggaa gaaggacatc acccagtgct 300ctgtgcaacc tgaacctgcc cctccctcag ccagtcccct gcccagaggg tggcaaaaga 360gtgttctgga gctacagacg gggccaggga gctcacaaca ctatggagcc atgagaaccg 420tgactgaaca gtatgaggag gtggaccagt ttgggaacac agtcctcatg tcttccacca 480cagtcaccga gcaggcagag ccacccagga acccaggctc ccacctcggg ctccacgcct 540cccccttgct gaggcagttc ctgcacagcc cagctgggtt cagcagtgac ctgacagaag 600ctgagacggt gcaggtgtcc tgcagctact cccagccagc tgcccagtga ggcccaccgc 660ctcccaccac acctgccacc tgttcctggc ctccactgcc ccaggactga agtgggtacc 720tgcctcctgt acactggagc aaggaccaag aggaaatggc atcttcagag gattactgtg 780ggccatttcc ctttcgcagt tctttcaata ggcccagttc ttccaaatgg aaaaagaaag 840gtctggaaga ggcccacaga gttgcacagg cgtgggggta ggatgggggc tcccagctgc 900ttgtggagga tgtaatatat acagacacac acatgttttt cacacaggcc tggcccacgc 960atcgacatgt gtgaatttgc acaccactgc ctgaattgga gccccccaga gtgtccctct 1020acccagagtt tttatttctt taattagtct gagtgttccc agccatctgc tccttaatcc 1080ctggagagga acagagccaa ctggacacag cgttggtctc tgtttggaat cactgtgagg 1140tctccagaag gacctggccg ccagcccctt catcaccatc tccatcattc agctggtcat 1200ctggtggccc aaaggtcacc caaagagtca gcaatcagca tgtccctaga agccaaatgc 1260actgcctttc tctgtcccca tgactgtccc ccactctgca ccccaaatgg gaagcatacg 1320gtctgaataa atccaagttt tattctctaa aaaaaaaaaa aaa 1363 28 1513 DNA Homosapiens misc_feature Incyte ID No 118472CT1 28 cttcaacatg cccctcactatctcccggat cacaccaggc agcaaggcag cccagtccca 60 gctcagccag ggtgacctcgtggtggccat tgacggcgtc aacacagaca ccatgaccca 120 cctggaagcc cagaacaagatcaagtctgc cagctacaac ttgagcctca ccctgcagaa 180 atcaaagcgt cccattcccatctccacgac agcacctcca gtccagaccc ctctgccggt 240 gatccctcac cagaaggtggtagtcaactc tccagccaac gccgactacc aggaacgctt 300 caaccccagt gccctgaaggactcggccct gtccacccac aagcccatcg aggtgaaggg 360 gctgggcggc aaggccaccatcatccatgc gcagtacaac acgcccatca gcatgtattc 420 ccaggatgcc atcatggatgccatcgctgg gcaggcccaa gcccaaggca gtgacttcag 480 tgggagcctc cctattaaggaccttgccgt agacagcgcc tctcccgtct accaggctgt 540 gattaagagc cagaacaagccagaagatga ggctgacgag tgggcacgcc gttcctccaa 600 cctgcagtct cgctccttccgcatcctggc ccagatgacg gggacagaat tcatgcaaga 660 ccctgatgaa gaagctctgcgaaggtcaag ggaaaggttt gaaacggaac gtaacagccc 720 acgttttgcc aaattgcgcaactggcacca tggcctttca gcccaaatcc ttaatgttaa 780 aagctaaaag gctgcctggaatccccccac cccaacaggc tggactccct ccatccttac 840 ccccacacag atctggcatgtgagccccac ggtgatgctt gacaatgtat aactctgctg 900 ggggcacctc tgatggccaaccgcagcatt tctgtcctct gcccacccca gagctgatgc 960 tggggcccag ccccctgcagctctgtaccc accaaacctc cccagggcaa ccctcgccac 1020 cccccaaata gcccgtagcccaatcccctg ccctctgcac agggccttag ctgtagacca 1080 gagagggcag gaggggtttgctggcataac accccagaac caagggaaat ggatgggccg 1140 ctgctcagtt tcccaccatcctcagctcct ggcctcatcc cctcctagaa tgagtcaccc 1200 gtagatcagg gtctggggaagaggctgatc cctggcgctg cccggctccc tcgctgccct 1260 ctggagctca gggcagcccggaatagggct ctttgaagag gaagtagaag ccccagggta 1320 atgaggcaga gacccctcctggcagtggtg aggtgggggc atgcaccctc ctttctgtac 1380 cgtgtgtgct ggctccatagttctctcttc tgtacatata agcatgcttg ttctgaaata 1440 aagaagattt gaagtgaaccacaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaataa 1500 aaaaaaaaaa aaa 1513 29627 DNA Homo sapiens misc_feature Incyte ID No 1314633CT1 29 gcctgtgtgagcctgaagac ctttctgtcc tggcgacccc tcagaagggc tgcactggat 60 cttgtctgcccggggagcgc acctatccat tggagggaag agcctcctgc gggtagagga 120 tggccagctactcagcaaac tggacttgag ggggcctggg cagctggagc cctgctctga 180 ggaagaagcacattccctga agcgtctgga agatcagagc cctgggccac caagggggtg 240 gcctgcaggaagagcccctt cacggagaaa ccttgctcag aatccctgcg ggtgccagtg 300 gagccgcttttcgcctttgg ggcattctgg actcagcttg ggctgctgct cccgacccct 360 acccccagccccatgcccgc gcttcccctg ctgtgtgtag tgggagatct ctctgtgcct 420 ggcagcccctgcagaccctg ggagggagct caggctgagc caggcactgc aggggatctg 480 ggaaagccaagatgggcaag gaaacccttc tatggccagg agtggtggct catgcctgta 540 atcccaacactgtgagaggc caaggcagaa ggatcagctt gaggtcagga gttcaagacc 600 aaggggggcagcatcgtgaa gagaagg 627 30 1606 DNA Homo sapiens misc_feature Incyte IDNo 1997439CT1 30 ctcgtagact cgcattgact taattttttt aaatcttatt gcatattttgactagataat 60 aaatgcatat ggttaaaaaa ttcacatggt tcaaaaaagt acacctcccactcatcttcc 120 atgtgatatt tcctttctgc ttagcaattc tgtatttatc ttgctaaacatgaatgacag 180 ttgtttgctg aaattacatt aaatgtgacg taataaaatc attgtaagtatacatttttt 240 aactttaata atttttaatg tcttaatgaa gagtatgaag agtagtagtactgctcttca 300 aagtactact actttacctt accttttact gttttgttaa gaaaattaggccgggcgcag 360 tggctcacgc cggtaatccc agcactttgg gaggccgagg cgggcggatcacgaggtcag 420 gagatcgaga ccatcctggc taacacggtg aaaccccatt tccactaaaaatacaaaaaa 480 ttagctgggc gtggtggcga gcgcctgtag tcccagctac tcgggaggctgaggcaggag 540 aatggcatga acctggaagg cggagcttgc agtgagctga gattgcgccactgcactcca 600 gcctgggcga cggagcgaga ctctgtctca aaacaaacaa acaaacaaaagacccaatct 660 gagtcttatc gttgtactga tagaagggtc agatatcccc acatggagttgagtgggaga 720 aagagattca ctagagaata actccttaga gaccaatgtc tgtagcaggtgtacagcatc 780 ttgtgaaagt tatggagcat gaaaagactg aagggccagg acagtttgcatgggctgagt 840 tataccagct agaccaggaa tagaacaaag aattctatac ctcaggatttcaaaaagtta 900 gcaacttgag aggccagtgc tgagcaaccc agtacccagg aaatgaaaaaaaaaaagaaa 960 attccctccg agaatgaaca aatcattggc ttcattgcct catgagcttgagagaaagga 1020 gaagagagcc agagtgtggc aagtgaggcc aaaatcagaa gcatggcagaaatgagtgta 1080 agtgattgag ccacagacag aagtgtggcg agggacaatg ccatattgggagaaggtaaa 1140 gttgagtaac aagaaaccaa ccgtgtgtga gagggggatt ggaaaaaaatttgagggaga 1200 agaatgttag aatggaaggg aatgatggtg gaagggaggt gtgagggtgtgtgctgagtg 1260 ttgaaagaac ggttggtgtc tgtgtgattt tccttgagtc tgttcttcagtgtgtcttct 1320 gcagcttgcc atgactgcct gggaaagagt agggaaatac ccagagccaaaacctccttt 1380 cagtcccacc ccatccctca aaaccccagc tattgcttct tttcagcttcaggtcctgat 1440 ctccaatctt agtatggact cccttctcac caagaccacc accagctacgtttgctgtgt 1500 aatctggaaa gtgataattt cctttgcttg ttgggtgtga gtcacaatactttggtttgt 1560 gcacaagaat aaatttatgc cccatacctt caaaaaaaaa aaaaaa 160631 2184 DNA Homo sapiens misc_feature Incyte ID No 2638878CT1 31gccaaatgga ttgagtgatg agcagacatg tttaagggtc taagtctcaa gaatctgtta 60tgtgtgtttg ctgcggtggg agggggtgct tgtatttatc ttatttccag tcactataag 120gttgtacaca aactaattta aagtttactt aataatggta tctttaaaat aattgacaca 180attgcaaaat gaattcctgg cttcagttag ctattatttt tttaatgaca acatagactg 240tgctctaagt ttaaaagatg gggaagctta tataaaagtg acccttttgc atcatatggg 300tatctaaact taatttaccc aataagttga tgcttaatga ttttatttta tttttgtcta 360tttctatttt agttgtggct ttgctctaag aatgggtaat agttgtacta cagactgcta 420taaatttctt gtgatactct tttagagctc aaaatatctc tgagctttag acatggtaag 480gtggagagta aatgcttgat aaatctttaa gatatgtctt gaatgataat taggacattc 540agtccagtgg aaatacacca ttcaattagt caggtctggt gaatcgtttg tttaaaatat 600tagcaaatga gatgtggaat tctgaaattt ctccagactg tgtcttaata aaaatgtcac 660ctgggtgaaa ttttagatca atcactaaat ttgggtgaca aatataaaaa tattttcatt 720tcactttaat acattctttc tgtgaagtaa aatgtttttc tttctcataa tggcaaaata 780tgaatgccat caaagtttaa ggaattcatt ttagccttaa atgccttcgt gagatgtctt 840acttgtattt taggtaactg gtcatcagtg ccaatgacat ggataacaat ttttaatcta 900ctcgacagtg catccctggg aatgactgtt atgtttttgt catattcctg gtaatataaa 960tactcgtgtt ctttactaca ttgtttttat caactctaaa agtcatgcct ctgtgacctt 1020tatcatgttt acaattgcaa ctgaacttat gacaaattaa ctcaggaaat aaattgagtt 1080atcctttcta gcattgtaat taccatcagc aaggcctgag atagccagag ccaatactag 1140ccaagtgatt tattttcaag gattgccact aactacggtt ctttaggacc aagatataaa 1200acagtcacta aaaatcatta ggctaggtat cagtaataca ttcattacta ataatgcatt 1260tttggagact tttgtgaaag aagttggtct ctgccaaaag ctggtggacc acattcacac 1320cacgaaagcc agtgtcacat gaaccagatt aatgactctc tttatggggt atgtgggaca 1380tcctggaagt gtataatttc aggaatgacc agacaatacc atcttgcaaa gccccttcag 1440gtgacaatct aaacttgtgg gtaggagagt gcataaagtt tattgctcaa ctgctcctcc 1500agcctgctga atttactgag taaagaaata gcaaatatga tagatgtttt agatttcata 1560gaacagaatg gtttgtccat taattctttc attcaatgac tgtttattga atacctactc 1620ttttagggcg ctgtgttagg tgctgtattg tacaagaaaa atataataaa ttagattccc 1680agcgctattc tgacatagtg aatgaccttg aaaaatttac taaacatact atgtttgttt 1740ctccatgagt aaaataggga tatagggaca aacagtctaa tatctcatag aaataccatg 1800gagacaaata aaatatttta atataaatat gatattaaag taaatttctg aagtaatact 1860tttgggtatg gcactagttt ttcctctgac tattttactg tttctttcac tctcaatata 1920aaaactattt gataagataa aacgatatat tttattgtaa ttagaattta gacaaatcag 1980ctataatgta aaaatgttaa taataattac gttttatctg attaaagtta caatgatcat 2040agcactttaa aaatattatc tgaactgtca tttgtttata tattaccgtc taataaaata 2100gttatagatc ttccaagttt gatgccttac attttaaaag gaaaagataa atggttgatt 2160aagaaaaaaa aaaaaaaaaa aaaa 2184 32 1833 DNA Homo sapiens misc_featureIncyte ID No 3795510CT1 32 cgggcagtgc aagctaaaat taaccctcac taaagggaataagcttgggc cgccattttt 60 tttttttttt tttttttttg ctctttagaa gaggttatatttttattatc cttattttgg 120 agaacttttc cttataaaat tttttttcca gattccttatgaactcaagt tagtgttaaa 180 gctttggatt ccactgttaa cagtttatgt aaaaacacttaacaaattgc catttatatg 240 ccaaactata gctcaagaac actctgtttt agaaaaattacgcattagat caggaagcct 300 catatatatg tgcctctggg acttcatttg cagtcacatttagccagaaa agcaatgact 360 tctatattcc ttatggaaac caatgtaaca taaattaatgttctaaatat agaaattaag 420 agttcataaa gagactgagg ttgcatgtaa aagagttatggtttgagaca gtctaaaaat 480 actatgttaa tttcaaggat cttatttcca atgttttgtttaaaaaatta taaatacttt 540 tgagctcttg ctttgcattt caatcgcaaa cccactcagatacgggaact gtttaaattc 600 atatatggac aaataggttt cagtgatgca atactttaaaattctgccat ctccttgtgt 660 ttttctttct aggtgagtgg actgccagct cctgatgtgtcatggtatct aaatggaaga 720 acagttcaat cagatgattt gcacaaaatg atagtgtctgagaagggtct tcattcactc 780 atctttgaag tagtcagagc ttcagatgca ggggcttatgcatgtgttgc caagaataga 840 gcaggagaag ccaccttcac tgtgcagctg gatgtccttgcaaaagaaca taaaagagca 900 ccaatgttta tctacaaacc acagagcaaa aaagttttagagggagattc agtgaaacta 960 gaatgccaga tctcggctat acctccacca aagcttttctggaaaagaaa taatgaaatg 1020 gtacaattca acactgaccg aataagctta tatcaagataacactggaag agttacttta 1080 ctgataaaag atgtaaacaa gaaagatgct gggtggtatactgtgtcagc agttaatgaa 1140 gctggagtga ctacatgtaa cacaagatta gacgttacggcacgtccaaa ccaaactctt 1200 ccagctccta agcagttacg ggttcgacca acattcagcaaatatttagc acttaatggg 1260 aaaggtttga atgtaaaaca agcttttaac ccagaaggagaatttcagcg tttggcagct 1320 caatctggac tctatgaaag tgaagaactt taataactttaccaacattg gaaaacagcc 1380 aactacacca ttagtaatat atttgattac atttttttgaaattaatcca tagctgtatt 1440 aacagattat ggttttaatt aggtaatata gttaatatatatttataata ttatttatcc 1500 tttgactctt gcacattcta tgtacccctc cgatttgtgaagcctacagg aaatctgggt 1560 atatggattt gtaactgcag aagactatct taaaatacaggattttaaca tttaagtcat 1620 gcacatttaa caattacagg ttataaatta gtatcaactttttaaacaca tctaatgctt 1680 gtaataacgt ttactggtac tgctttctaa atactgttttacccgttttc tcttgtagga 1740 atactaacat ggtatagatt atctgagtgt tccacagttgtatgtcaaaa gaaaataaaa 1800 ttcaaatatt taaaacggaa aaaaaaaaaa aaa 1833 331859 DNA Homo sapiens misc_feature Incyte ID No 1413537CT1 33 cttctctttcctgagcctct ttagagcagc acttacagga ttgcctctgt aaagccttat 60 tcctgtcccagaaaaggtaa tccaaaaagt ctctagtatc cactaaaagg taacccaaaa 120 atctctagtatccactggct ttctccagtg tggaagcttt cccctccacc tcccatagat 180 cactggaaaggacccgaggc ctcggttcta atccctggct tatcactaac tgctgtgtgg 240 ctttggcttgtcccttagtc tctgtgagac tgctgcaccc tcatctgtca aagatggaac 300 tggacttagttgagctctga ggtccctgtg gacttggccc ctccacaccc tcattatggc 360 aactggacataaacttaaca gagggcttcc cagcaaaatg tcctcttctt cctacaaaca 420 ggctgtttctatatgtgcat gtttcatgct aagcacttct ttcttgggtg gagatggcaa 480 aggcctctttctgctgagac aaagtgattt ggagagtcac ctggcccctg aagggggagt 540 ggtaggatccagccacccag tgtgcagtga attggagcag ggatctcagc acacagggag 600 gtggggaggctccccctaac ctcgggcacc tgttgctcct ccagactgca gcgcatgctc 660 ttagctcatcctcttaactg gctctcaccg tgctcctggc tttggtcacc acgtagctct 720 cactccagcttcaggtagcc atcagtagga cctggcaata tacactgatt tggtttgttt 780 tatgtttgtctgcaggtgaa atccctaagg gctctgccgt gtactccagc cttgtgaccc 840 ttgccttccaggaaccatgc aagaagcgca gccaccagaa gtccttaaaa cagcaggaaa 900 ggtgagcctgtccccctttt gtgcagctac ctatctgctg aggagcatct gggcctcatt 960 cctccaagtccactggaggg tccagaagag ggagtcagag atgtatcctg gtggagctgg 1020 gagaaaggcagaaagccttt gtgacagcta tggaataccg ttagccaagg tccacttggc 1080 ccagcactaagcaaaagatg cgtagtttgc acagaaggtt ttgtgatact gcctctcaac 1140 agccccagcagcttgggaac tagcaagagc acatttcttg cctcatcagc tgtcctgaga 1200 tggaaaactcagtggatata ggaccctgat tccgatgaaa ggggcacgtg gtcccaatgc 1260 tggagctcctctggcaggtt ctaaaagcac actacggagc agcggtgccc tgccggacac 1320 tgctggcgggggctcagtga gcactactca cagatccaca cctgaccctg ttgggtcgag 1380 tcaggctgggctttggtctg cactgtagca cctgtgttct ttgagttcac atcatgaatg 1440 tggtgatttcccagatacca tctcaggctt aacctagcac atcctatttc ttttcttcta 1500 tgatatccaaattggactga cctcacttca aagttgctgt cccattttgt caccctatct 1560 tatctcggggaaattgcaga ctgatggcca gaccaactct gttgaaattc ttgcatagag 1620 caaacctgtgctcattttta agtggcatgg gagaggcccc aagcctagta aagcctagtc 1680 tgtgtcttcacagtgctggt agaatgtgtt tgtgtgtata aatatatgat atagatttat 1740 atatgttgctaacgccacat attgaaggcc aacataactg gtggacaggg tgggtgacag 1800 aaaatgaaagtctttttggt gattgtttaa gcaagatgtg tataaagaaa taaatagtt 1859 34 2125 DNAHomo sapiens misc_feature Incyte ID No 1623157CT1 34 tgtgtaaacaataacaagaa gacatgaagg atttatttgg ttatcaactg cccatggagg 60 aggctcttgatgatcccagg tctcctcgac ctccatacac cacacaggca tttgtaagca 120 cagtttccacaagcaccttg taggaatatg gataagatta gaccagcccc tctctgtcca 180 ctgggtttatttcttgaaga agatgcagat ctggtttttc caatgtgcca cagtctttcc 240 ttatcctctccatgctgagc ttgacaacac tctgggaatg aggaacaaga ctttttctaa 300 aaagatagtggaagttcaag ggatgtacct cgttttcagg ttcatccatc tccagtggaa 360 tgttttcaataaaagatgaa gaaaatgtgt gtgatcttta ataacacatc cctatagaaa 420 gtggataaaagatataccaa aactgtaata cagatatata caaatatagg tgcctttttg 480 attactcttgtttgtctagt atggtcttgg aaagaaaacc aagcaagcaa gttgctgcct 540 attctatagtaatattttat tacacatgat tgatattttt gtggtaggga agtgggatgc 600 tcctcagatattaaaggtgt tagctgattg tattttatct ctaaagattt agaactttag 660 aaaatgccgacttcttccat ctatttctga aaggttcttt gtggatttat atagagttga 720 gctatataaacattaacttt agatttggga tttaaaatgc ctattgtaag atagaataat 780 tgtgaggctggattcactac acaagatgaa cttcacttca taaattaatt ataccttagc 840 gatttgcttctgataatcta aaagtggcta gattgtggtt gttttggtta aggtgatatg 900 gaggtgggagagcttttagt taagtaagaa gctatgtaaa ctgacaagga tgctaaaata 960 aaagtctctgaagtattcca tgccttttgg accctttcct cgcaactaac tgtcaactgt 1020 tgatcaaaaaagtcaaggca ttgtatgttg cttctgtggt tattattctg tgatgcttag 1080 actacttgaacccataaact tggaagaatc tttgagcaaa ttttctcagt tgtctgtatg 1140 acttcagtatattcctggga atgccatagg attttttgtg cttgatacat ggtatccagt 1200 ttgcatagtatcacttcttt gtaatccagt tgctgttaag aatgatgtac tttaaaggaa 1260 aagagaaaactgcatcacag tcccattctc cagtgtccat gcaatgaatt gctgagcatt 1320 taggaagcagcaccaagtct attacaggca tggtgtgaaa cttgatgttt gacctgtgat 1380 caaaattgaaccattgtaca gtttggcttc tgtttgcttc aaaatatgta gaattgtggt 1440 tgatgattaatttgcgagac taactttgag agtgtaacag ttttgaagaa aacattgaat 1500 gttttgcaaatgaaggggct tcacggaatg ttacaatgtt actaatataa tttggctttt 1560 gttatgcaaattgttaacac cagctattaa aatatatttt agtagaaatg ctttaattca 1620 tatttttttcctctacactg tgaatcttta agccttggtg gactagagca acatcgtgct 1680 gcccaaaggactaacctatg caaactagtt cacattttag tggatgtcgc agttaatgtg 1740 taataagacattatttcccc tgcataatgt acaacagcat tgaaatgaca cattaagcct 1800 agcatcacattgtatagtac agtcactcac aaacccttca aggctaccct aatcattaac 1860 attaatatttgtttaaaagc aaatcaccga tttatctatt gaaactactt aaatgacggc 1920 aaaccaggaatgacagatgg ctgtgtcagc aatggcttta atgtgttccc tgcaagtggt 1980 ctcctatgatagaactgcgt tctcaaatgc actctcttca gggtcttaat attctgtgtt 2040 ttctctctgtatttgtaaaa cattataaca cattaatttc ctatctctac acatttggtt 2100 tgcttaaataaatgcaggat ataaa 2125 35 1686 DNA Homo sapiens misc_feature Incyte ID No3009303CB1 35 tctgactgcc agcaccttac agagaagaga ccatcaccac tgtggtgaagagcccacgtg 60 gccaacgacg gtcccccagc aagtccccct cccgctcacc ttcccgctgctctgccagcc 120 cgctgaggcc aggcctactg gcccccgacc tgctgtacct gccaggtgctggccagcccc 180 gcaggccgga ggcagaacca ggccagaagc ccgtggtgcc cacactgtatgtgacggagg 240 ccgaggccca ctctccagct ctgcccggac tctcggggcc ccagcccaagtgggtggagg 300 tggaggagac cattgaagtc cgggtgaaga agatgggccc gcaggtgtgtctcccaccac 360 agaggtgccc aggagctcat cggggcatct cttcacactg cccggtgcgacccccggagg 420 gaccccaatt ccaacaactc caacaacaag ctgctggccc aggaggcctgggcccagggc 480 acagccatgg tcggcgtcag agagcccctt gtcttccgcg tggatgccagaggcagtgtg 540 gactgggctg cttctggcat gggcagcctg gaggaggagg gcaccatggaggaggcggga 600 gaggaagagg gggaagacgg agacgccttt gtgacggagg agtcccaggacacacacagc 660 cttggggatc gtgaccccaa gatcctcacg cacaacggcc gcatgctgacactggctgac 720 ctggaagatt acgtgcctgg ggaaggggag accttccact gtggtggccctgggcctggc 780 gcccctgatg accctccctg cgaggtctcg gtgatccaga gagagatcggggagcccacg 840 gtggggcagc ctgtgctgct cagcgtgggg catgcactgg gtccccgaggccctctcggc 900 ctctttaggc ctgagccccg tggggcgtca ccaccgggac cccaggtccgtagccttgag 960 ggcacctcct tcctcttgcg ggaggccccg gctcggcctg tgggcagtgctccctggacg 1020 cagtctttct gcacccgcat ccggcgttct gcggacagtg gccagagcagcttcaccaca 1080 gagctttcca cccagaccgt caacttcggg acagtggggg agacggtcacccttcacatc 1140 tgcccagaca gggatgggga tgaggcggca cagccctgat gctgctgccatggtggcttg 1200 gggcagcggg gagaaaggag tgtccttgag gcctaggacg ctgcccggcctcagcagcag 1260 ccctgggagc ctcctgaggg ccctccctgt ccctggccac gggcccttcttacctcactc 1320 aacttcagcc aggaggactg ggtggtgctt gcaatgttgg aatgaccggctcaaagacct 1380 cagctctggg ctgtttcctg tcagcctggc aggagcctca ggactgtggacgaaggatgt 1440 ggccttgggc atttgtcctg ttcccacatg ggcctggtcc ctccctcctggccccagcca 1500 cagctgccag gcctgacatg gccttgcctc tcctgcagtc ttggtgactgagacccttgg 1560 gtggcgcttc ccagctctgc aggccctcct ggccttttct gcagggtggacacagggtct 1620 gtgtgtgggc agcagcccct gtctctcagc aagaataaag cagcttcctgtgcaaaaaaa 1680 aaaaaa 1686 36 2350 DNA Homo sapiens misc_feature IncyteID No 3434460CT1 36 cttgaaagga tcattgtgcg gattaaaaga aataatatatgtaaagcact ttaacacagc 60 accaggccca cggaaagtgg ctaatgttag ctactatgaatggtgccagt gaagacactg 120 aaaaataagt gatttcagta accttctgga aagctatcagtttcaaataa tattttctct 180 gtaatatgag atgaaattaa aagtggatag ctttcaggaaagataaagag aacatgctta 240 gaatgtaagc taaacagatt ttttctgttg ctctttgaaaactatgagcc ctggccagct 300 taacctggtc tgaggtgaga ctaaacacaa aaacagtagataaatctctc cctaaaagat 360 ggattccccc acatacccat gctactagtt tctctgtctattcacacata tgtacaaata 420 catgaacaca gcctgtctgt gctcagacat agagaagtactacctgactt gagtcaatgc 480 acccaagaag aaaagcttgg agtagagcag aagggagggcttgggactcc tgtctttcca 540 gcatgccctg gggtgcagtg gtcagccacc tgaagagagagccaatagca tggggtttac 600 aaggcaaaga tagtcattca ttcaacacat attcatagagctccttctct gtgccagaca 660 ctgttctgga agatagctag atgaaaatct ttgcactcacagagcttaca tgccagtgag 720 tgaagatcga tgataaataa agcaaatgca tcatatgttcacatttgata agtatatgcc 780 aaaaaatgaa gccgggaagg aggacaaggc ccatgggtgggtgttgaggt ttttaaagtg 840 tggtcaggaa aggccccact gataaggtaa catttgagcaagtctgaaaa aggcaagggg 900 atctttgggg ctaacttcgg gatccctgca ctttatgtaagaatgtaaac ctggagtctc 960 atttaagaat gatcagcaat acgtttagaa catatgaactgaatgaaatg gacatttttt 1020 cttaatttac gtataaatcc atatgattat acataaagttctgatgcatt aataaaagca 1080 gccaaatagg gccaaagaga aaaataacag gactctgtactggacctaac tttatcatta 1140 attaggtaat attttcctca tttctttact gctgccattttcctcaccag tattccagag 1200 atggtcatag ctcattactc taccaccaag aacctaaaaggaattagaat acagcagaat 1260 tggcctcagt gaagagctta aaattgttct cctcgtagaactggactatt gatcattacc 1320 acgtgacgtt ggctctatta ctttctgttc ccaatgtccttctagtggtt tgaaaatgtt 1380 aaaacatccc taaaatctaa atcatataat cagaattctatagtgtccca ctctatctgt 1440 aaagatcatt tggaagactt tagactctat taattttaaaaggaatattt attagccata 1500 tgcagaattt ctaatgatga tattgtacag cttctaattcacttttcaga tcagtgtttg 1560 aaatggcaat tatcagtgtt ggatttagtt ccaactacttgatttacaaa aatgtacatt 1620 tagaggttaa aagaaacagt gagaaatgta aacattcaaaatgataattg aatctctcag 1680 ttgtgggaat aattatcaga gacatgcaac tgaaaatgtctcacctttca tctttttttc 1740 ttaattcata aagttatctt gtagaatttg atgagaccctcctagtcatt ctcaactggg 1800 gcggtgctgt caccgaatgg tgtttgagag tgttggggctagggcacatt tttggttgtc 1860 acagcaactg gggtggcatt tgctgcccag tgccaggaatagtaacatta tgaatgccag 1920 ggacagtgtg ctcagtaaag tcttccatcc aaaaggggcagggcacggtg gctcacgcct 1980 gtaatcccag cactttggga ggccaaggtg ggcggatcacctgatgtcag gggttcgaga 2040 ccagcctggc caacatggtg aaaccctgtt gctactaaaaatacaaaaat tggctgggtg 2100 tggtgtcaca tgccagtaac cccagctact agggaggctgaggcaggaga atcacttgaa 2160 cccgggaggc agaggttgca gtgagctgag attgcaccactacactccag cctggatgac 2220 agagtgagac ttcatctcaa aaaaaaaaaa aaaagggcggcagctctaga ggaaccaagc 2280 taacgtacgc gagcatgcga catcatagat cttctatagtgtcacctaat taatacatgg 2340 ccgtacagag 2350 37 3502 DNA Homo sapiensmisc_feature Incyte ID No 5022769CT1 37 gcggccgctg acagcaccag catgtcttacagtgtgaccc tgactgggcc cgggccctgg 60 ggcttccgtc tgcagggggg caaggacttcaacatgcccc tcactatctc ccggatcaca 120 ccaggcagca aggcagccca gtcccagctcagccagggtg acctcgtggt ggccattgac 180 ggcgtcaaca cagacaccat gacccacctggaagcccaga acaagatcaa gtctgccagc 240 tacaacttga gcctcaccct gcagaaatcaaagcgtccca ttcccatctc cacgacagca 300 cctccagtcc agacccctct gccggtgatccctcaccaga aggaccccgc tctggacacg 360 aacggcagcc tggtggcacc cagccccagccctgaggcga gggccagccc aggcacccca 420 ggcaccccgg agctcaggcc cacctttagccctgccttct cccggccctc cgccttctcc 480 tcactcgccg aggcctctga ccctggccctccgcgggcca gcctgagggc caagaccagc 540 ccagaggggg cccgggacct actcggcccaaaagccctgc cgggctcgag ccagccgagg 600 caatataaca accccattgg cctgtactcggcagagaccc tgagggagat ggctcagatg 660 taccagatga gcctccgagg gaaggcctcgggtgtcggac tcccaggagg gagcctccct 720 attaaggacc ttgccgtaga cagcgcctctcccgtctacc aggctgtgat taagagccag 780 aacaagccag aagatgaggc tgacgagtgggcacgccgtt cctccaacct gcagtctcgc 840 tccttccgca tcctggccca gatgacggggacagaattca tgcaagaccc tgatgaagaa 900 gctctgcgaa ggtcaagcac ccctattgagcatgcgccgg tgtgcaccag ccaggccacc 960 accccgctgc tgcccgcttc tgcccagccacctgctgctg cctctcccag tgcggcttcg 1020 ccacccctgg ccacagctgc tgcccacactgccatcgcct ccgcctccac cacagcccct 1080 gcttcaagtc ctgccgacag cccaaggccccaggcctctt cctacagccc cgcagtggcc 1140 gcctcttcag cacctgccac ccacaccagctacagtgagg gccccgccgc ccctgcaccc 1200 aagccccggg ttgtcaccac tgccagcatccggccttctg tctaccagcc agtgcctgca 1260 tctacctaca gcccgtcccc aggggccaattacagtccca ctccctacac cccctcccct 1320 gcccctgcct acaccccctc ccctgcccctgcctacaccc cctcacctgt ccccacctac 1380 actccatccc cagcaccagc ctataccccctcacctgccc ccaactataa ccctgcaccc 1440 tcggtggcct acagcggggg ccctgcggagcctgccagcc gtccaccctg ggtgacagat 1500 gatagcttct cccagaagtt tgccccgggcaagagcacca cctccatcag caagcagacc 1560 ctgccccggg gaggcccagc ctacaccccagcgggtcctc aggtgccacc acttgccagg 1620 gggaccgtcc agagggctga gcgattcccagccagcagcc ggactccact ctgcggtcac 1680 tgcaacaatg tcatccgggg cccatttctggtagccatgg gccgttcttg gcaccctgaa 1740 gagttcacct gtgcctactg caagacttccctggcagatg tgtgctttgt ggaagagcag 1800 aacaacgttt actgtgagcg atgttatgagcaattctttg ccccgctgtg tgccaagtgc 1860 aacaccaaaa ttatggggga agtaatgcatgccttgagac agacatggca caccacctgc 1920 ttcgtctgtg cggcctgcaa gaagccttttgggaacagcc tcttccacat ggaagacggg 1980 gagccctact gcgagaaaga ctacatcaatctgttcagca ccaagtgcca tggctgcgat 2040 ttccccgtgg aggctggcga caagtttatcgaagccctgg gccacacttg gcacgacacc 2100 tgcttcattt gcgcagtctg ccatgtgaatctggaggggc agccgttcta ctccaagaag 2160 gacagacccc tgtgcaagaa gcacgcacacaccatcaact tgtaggcggc caaggccgcc 2220 tgtgctgacg aggcccggag ctgctcctgctgctggcaac aaaggattcg ggaggctgat 2280 gtttcttctg aggggaatgg ggagagagaggaagcgactg agccctttgg aagtataatt 2340 ttaggttttt tcttctgtac acagatcgtgcatttgcata gttcagacta ggagccaaat 2400 gaagactcaa aaccaagcta gttattaatccaagactgga attgtacttc agacatttag 2460 agcagaattc caagaactca aaagtgaaaagcaacaagca gctttcccaa agcgatacac 2520 ttgctttggt caccagagga ggacagagcttagagcagct gtggagaatc tgaagcattc 2580 tgcggagttc ttaagcgctc ccctggcaaacaaattgaag tgccaaacag cactcgctgc 2640 agggtatttt tagagtcata gctgagagcttgttagctaa gacccattgg gctttcctca 2700 ccaaaaaagg aagtgttatt ccattactagcgtcatggag ctacctctgc gcatcagact 2760 tcagaccttg aacaaactta aaaccttcttgggagcccgg acgtccaaag agatgtcttc 2820 tgggagccac tgggcaattg ccagggctccaggaagggct ctggctcagg ttgcagacag 2880 ctgagaaaag atggccctgt cagccaccctctctcagtct gaaacatcca acatccccag 2940 aaggcttagc tcctttttga attgtgatgggaaagtagag ttgggttttt ccagttttgc 3000 tctgtggtgt gtgagagatt tttttaaaggctttgggttg tctttggcct ttgtttagct 3060 ttaagggttc gttagcatga gtgtccagtcgtgtgcatga atttcacccc aacttgtgac 3120 tgctcactta tgacgtctcc cccagtaccctccatctcaa ataggcttgg tggcctgtgg 3180 aaaagaagag agacagagag acagtgtctgaaacaggatg gcagaatagg ctcacatgcc 3240 caaactctgg gtggggaaga ggaaacttactttctgccac cctcagtaag aacacacgag 3300 gaggcaggac ctcccacctt caggtctgcatcatcctttt caaatgttcc tttaaatgca 3360 gcacactgag tttgtacaat tgtgttaactgctggaaggg acagatgcac tgatatatat 3420 gcatttgctg ttttggccaa tattttgaaaatgtatgagc tgagttgatc tagctattat 3480 ttaagtattt attgaagtag ag 3502 381689 DNA Homo sapiens misc_feature Incyte ID No 944140CT1 38 cagacgtatgaagcatccat ggacaagctg agggaaaagc agaggcagtt ggaggtagcg 60 caagttgaaaaccagctgct aaaaatgaag gtggaatcgt cccaagaagc caatgctgag 120 gtgatgcgagagatgaccaa gaagctgtac agccagtatg aggagaagct gcaggaagaa 180 cagaggaagcacagtgctga gaaggaggct cttttggaag aaaccaatag ttttctgaaa 240 gcgattgaagaagccaataa aaagatgcaa gcagcagaga tcagcctaga ggagaaagac 300 cagaggatcggggagctgga caggctgatt gagcgcatgg aaaaggaacg tcatcaactg 360 caacttcaactcctagaaca tgaaacagaa atgtctgggg agttaactga ttctgacaag 420 gaaaggtatcagcagttgga ggaggcatca gccagcctcc gtgagcggat cagacaccta 480 gatgacatggtgcattgcca gcagaagaaa gtcaagcaga tggtcgagga gattgaatca 540 ttaaagaaaaagttgcaaca gaaacagctc ttaatactgc agcttttaga aaagatatct 600 ttcttagaaggagagaataa tgaactacaa agcaggttgg actatttaac agaaacccag 660 gccaagaccgaagtggaaac cagagagata ggagtgggct gtgatcttct acccagccaa 720 acaggcaggactcgtgaaat tgtgatgcct tctaggaact acaccccata cacaagagtc 780 ctggagttaaccatgaagaa aactctgact taggcactca gaggcataca ctttttacag 840 atggacaaaagctctggaac cctgtggctt caaatccttt gggaagggtg actgttgttt 900 cccctacacacagtgtaagc cggaatggga atcgctgagg ctctgatcca cttctaagac 960 aggaaggaaagtgaaggcag agtgagcagg taagagaggg atatacaagg tcacatttca 1020 gacacccactcggcataccc tgccgtactg catcatcatt tgttttcttt gtagacactg 1080 aaatcctatcaggaggattc cttcacaatg tattttattt gctagacttt ggttgggagg 1140 gaaaaggacattaatttgaa gtttcatgtt attcatgcca ggattgtttg atagagcatg 1200 aaggttttgtttacccataa aagtattaga ggcagcgttt ctctgataca gagaggcctg 1260 tccacaagaagcatgggcac ccagccaaac ttgaacctgg aagggagggt tcccggcctg 1320 caggtgctctttcctcttgg tcccaagcat ctgtgcaggg tcgtgggagc cacactgaga 1380 gacttgtgtgggccagacaa gcttcattct gatgcgctag tcccttggtt taatttgtgc 1440 cttatgctttcattggacca gctgaaatca ctgtatttat tcaacttgtg attttttttt 1500 ctttctcactttaacttaaa gagaatttta tatgtcttgg aaatttaata atttagtgtt 1560 ctcagtatcaattggtgttt ttgttaaacg aatgaatcat ctgttcatgc atgctctact 1620 ttgatattataacctatgtc acatgtgttt aataaatacc atatattttg ttctaaaaaa 1680 aaaaaaaaa1689 39 1918 DNA Homo sapiens misc_feature Incyte ID No 3445829CB12 39cagcctgcca cttgcctccc tgcctgcttc tggctgcctt gaatgcctgg tccttcaagc 60tccttctggg tctgacaaag cagggaccat gtctaccttt ggctaccgaa gaggactcag 120taaatacgaa tccatcgacg aggatgaact cctcgcctcc ctgtcagccg aggagctgaa 180ggagctagag agagagttgg aagacattga acctgaccgc aaccttcccg tggggctaag 240gcaaaagagc ctgacagaga aaacccccac agggacattc agcagagagg cactgatggc 300ctattgggaa aaggagtccc aaaaactctt ggagaaggag aggctggggg aatgtggaaa 360ggttgcagaa gacaaagagg aaagtgagga agagcttatc tttactgaaa gtaacagtga 420ggtttctgag gaagtgtata cagaggagga ggaggaggag tcccaggagg aagaggagga 480agaagacagt gacgaagagg aaagaacaat tgaaactgca aaagggatta atggaactgt 540aaattatgat agtgtcaatt ctgacaactc taagccaaag atatttaaaa gtcaaataga 600gaacataaat ttgaccaatg gcagcaatgg gaggaacaca gagtccccag ctgccattca 660cccttgtgga aatcctacag tgattgagga cgctttggac aagattaaaa gcaatgaccc 720tgacaccaca gaagtcaatt tgaacaacat tgagaacatc acaacacaga cccttacccg 780ctttgctgaa gccctcaagg acaacactgt ggtgaagacg ttcagtctgg ccaacacgca 840tgccgacgac agtgcagcca tggccattgc agagatgctc aaagtcaatg agcacatcac 900caacgtaaac gtcgagtcca acttcataac gggaaagggg atcctggcca tcatgagagc 960tctccagcac aacacggtgc tcacggagct gcgtttccat aaccagaggc acatcatggg 1020cagccaggtg gaaatggaga ttgtcaagct gctgaaggag aacacgacgc tgctgaggct 1080gggataccat tttgaactcc caggaccaag aatgagcatg acgagcattt tgacaagaaa 1140tatggataaa cagaggcaaa aacgtttgca ggagcaaaaa cagcaggagg gatacgatgg 1200aggacccaat cttaggacca aagtctggca aagaggaaca cctagctctt caccttatgt 1260atctcccagg cactcaccct ggtcatcccc aaaactcccc aaaaaagtcc agactgtgag 1320gagccgtcct ctgtctcctg tggccacacc tcctcctcct ccccctcctc ctcctcctcc 1380ccctccttct tcccaaaggc tgccaccacc tcctcctcct ccccctcctc cactcccaga 1440gaaaaagctc attaccagaa acattgcaga agtcatcaaa caacaggaga gtgcccaacg 1500ggcattacaa aatggacaaa aaaagaaaaa agggaaaaag gtcaagaaac agccaaacag 1560tattctaaag gaaataaaaa attctctgag gtcagtgcaa gagaagaaaa tggaagacag 1620ttcccgacct tctaccccac agagatcagc tcatgagaat ctcatggaag caattcgggg 1680aagcagcata aaacagctaa agcgggtaag taaccagaga acagacatag gggcacagat 1740aaagtaaatg agttgtcctc cattgcatgg tggtaccaaa gtcacctctc acaatactta 1800tcaatacttt caatatttta gtatgcgaga gcaaacacac caagtttgaa acattaggag 1860caggcacaca agtgagcaca tttctatttg agaggaacgc ctgggccgct ttcccagg 1918 401086 DNA Homo sapiens misc_feature Incyte ID No 3016490CT1 40 gcggccgctacggcgtcttc gtcaagatca agtatggctc cgagacgggc cagggcacca 60 ttagtgtgttcaagcacggg gacgagccca aggagctgaa gagcatgtga cagcgtgtgt 120 ccaggcacagtctgagtcta gtctgcatgg accagtaggg acaacctgta ccagggtcac 180 agcctggcacaggctacagg ggtggggcag aaggaaaggg gacaagatag aacccaggat 240 gtgagggtgggggtggagcg gatgcaccaa agtggagaag caaagatctt tctggggtcc 300 tgagtggcttccaggagagc gggatgaacc ctggacctgg agtaggagac ccggatgcac 360 tggggctatctaacagtact ggcatctgat aggtagaggt caggtacgct gctaaacact 420 gcagctcccaccacatagaa ttatccgacc ccagatgtca aaagtgccaa gggccatgag 480 ccctgccataaactgataca tcgcacccct cttttaggat cccatagttt caattcatgt 540 aagttcaacagacacctgaa gtctagcatg tgggaggctg aggatggagc tgggaacaca 600 aaggcagctgataagcaggt tctgcttgca aagaggcctc agtccagtgg gagaaacaga 660 cctgggcgcaaacaactcca ggacaaggca ggacatgata aagattataa agcaggtcca 720 aggaaagtgccgccagtggt ccaaggaggg agacagaggg tcgtcccaac agggggaggt 780 agggctttgaaaacaccttc atccaggctg ggcgaggtgg ctcacgcctg taatcccagt 840 agtttgggaggccaaggcgg gcagatcacc tgaggtcagg agttttagac cagcctggcc 900 aacatgacgaaactcagtct ctactaaaaa tacaaaaatt agccaggcat ggtgggcagt 960 agctgtaatcccggctattc agaaggccga ggtgggagaa tccgttgaaa cttgggaggc 1020 ggaggttgtgaattgagcca gatttgggcc aaaaaaaaaa ttggccgaaa ttggtgtttg 1080 ggcccc 108641 3441 DNA Homo sapiens misc_feature Incyte ID No 4151935CB1 41gtttcaaagg acacaaagag agatgtggac tcaaagtcac cggggatgcc tttatttgaa 60gcagaggaag gagttctatc acgaacccag atatttccta ccactattaa agtcattgat 120ccagaatttc tggaggagcc acctgcactt gcatttttat ataaggatct gtatgaagaa 180gcagttggag agaaaaagaa ggaagaggag acagcttctg aaggtgacag tgtgaattct 240gaggcatcat ttcccagcag aaattctgac actgatgatg gaacaggaat atattttgag 300aagtacatac tcaaagatga cattctccat gacacatctc taactcaaaa ggaccagggc 360caaggtctgg aagaaaaacg agttggtaag gatgattcat accaaccgat agctgcagaa 420ggggaaattt ggggaaagtt tggaactatt tgcagggaga agagtctgga agaacagaaa 480ggtgtttatg gggaaggaga atcagtagac catgtggaga ccgttggtaa cgtagcgatg 540cagaagaaag ctcccatcac agaggacgtc agagtggcta cccagaaaat aagttatgcg 600gttccatttg aagacaccca tcatgttctg gagcgtgcag atgaagcagg cagtcacggt 660aatgaagtcg gaaatgcaag tccagaggtc aatctgaatg tcccagtaca agtgtccttc 720ccggaggaag aatttgcatc tggtgcaact catgttcaag aaacatcact agaagaacct 780aaaatcctgg tcccacctga gccaagtgaa gagaggctcc gtaatagccc tgttcaggat 840gagtatgaat ttacagaatc cctgcataat gaagtggttc ctcaagacat attatcagaa 900gaactgtctt cagaatccac acctgaagat gtcttatctc aaggaaagga atcctttgag 960cacatcagtg aaaatgaatt tgcgagtgag gcagaacaaa gtacacctgc tgaacaaaaa 1020gagttgggca gcgagaggaa agaagaagac caattatcat ctgaggtagt aactgaaaag 1080gcacaaaaag agctgaaaaa gtcccagatt gacacatact gttacacctg caaatgtcca 1140atttctgcca ctgacaaggt gtttggcacc cacaaagacc atgaagtttc aacgcttgac 1200acagctataa gtgctgtaaa ggttcaatta gcagaatttc tagaaaattt acaagaaaag 1260tccttgagga ttgaagcctt tgttagtgag atagaatcct tttttaatac cattgaggaa 1320aactgtagta aaaatgagaa aaggctagaa gaacagaatg aggaaatgat gaagaaggtt 1380ttagcacagt atgatgagaa agcccagagc tttgaggaag tgaagaagaa gaagatggag 1440ttcctgcatg agcagatggt ccactttctg cagagcatgg acactgccaa agacaccctg 1500gagaccatcg tgagagaagc agaggagctt gatgaggccg tcttcctgac ttcgtttgag 1560gaaatcaatg aaaggttgct ttctgcaatg gagagcactg cttctttaga gaaaatgcct 1620gctgcgtttt ccctttttga acattatgat gacagctcgg caagaagtga ccagatgtta 1680aaacaagtgg ctgttccaca gcctcctaga ttagaacctc aggaaccaaa ttctgccacc 1740agcacaacaa ttgcagttta ctggagcatg aacaaggaag atgtcattga ttcatttcag 1800gtttactgca tggaggagcc acaagatgat caagaagtaa atgagttggt agaagaatac 1860agactgacag tgaaagaaag ctactgcatt tttgaagatc tggaacctga ccgatgctat 1920caagtgtggg tgatggctgt gaacttcact ggatgtagcc tgcccagtga aagggccatc 1980tttaggacag caccctccac ccctgtgatc cgcgctgagg actgtactgt gtgttggaac 2040acagccacta tccgatggcg gcccaccacc ccagaggcca cggagaccta cactctggag 2100tactgcagac agcactctcc tgagggagag ggcctcagat ctttctctgg aatcaaagga 2160ctccagctga aagttaacct ccaacccaat gataactact ttttctatgt gagggccatc 2220aatgcatttg ggacaagtga acagagtgaa gctgctctca tctccaccag aggaaccaga 2280tttctcttgt tgagagaaac agctcatcct gctctacaca tttcctcaag tgggacagtg 2340atcagctttg gtgagaggag acggctgacg gaaatcccgt cagtgctggg tgaggagctg 2400ccttcctgtg gccagcatta ctgggaaacc acagtcacag actgcccagc atatcgactc 2460ggcatctgct ccagctcggc tgtgcaggca ggtgccctag gacaagggga gacctcatgg 2520tacatgcact gctctgagcc acagagatac acatttttct acagtggtat tgtgagtgat 2580gttcatgtga ctgagcgtcc agccagagtg ggcatcctgc tggactacaa caaccagaga 2640cttatcttca tcaacgcaga gagcgagcag ttgctcttca tcatcaggca caggtttaat 2700gagggtgtcc accctgcctt tgccctggag aaacctggaa aatgtacttt gcacctgggg 2760atagagcccc cggattctgt aaggcacaag tgatccttgg ctttcagaat ttgcaagaac 2820agcgatttga attttggggg ggtctgctgt tcattccttt aggtgctata cattattcaa 2880aaagtctccc gcgcatttgc actaatgatg gctgcatgca tagcaatcag catgtgagca 2940aaatcgacaa gaaaaccttg actttacaga gcagtgtgtg agtaaacaga atgaaaacaa 3000caacctccac tctttagttt atataagttt gagttctttc ctaaattaaa agatctacac 3060ttgagttggg aaccaaaaga gaaaaatgga cttccatctg ttttactggt aaaggaaatc 3120ctctgatgga caggtcagag tgaaggaagg ttgtgctggt aagacatctc tgacgaagag 3180ccatggatgc tttccacaaa atgtcacctc gctgcactaa aggatgatga atcctaatca 3240ttaaaggaat tgtttcagct gatttaaatt tataatgaac tcttttgtaa taatgtatac 3300tgtagaacat gagtctctcc tccctaaaat tttaaatgta gaaaagtgct atatattaga 3360aatttccatt ttgttaaata aatggttaga gtctataaag ccagtcatgt tatgtgaact 3420tactccatgt aacttactgg c 3441 42 1461 DNA Homo sapiens misc_featureIncyte ID No 3719652CT1 42 cactaagaag gggctgtgct ttgatcccct gcctcttgcactaccaatgt ctcaagacat 60 aatattcatc tcttgctgtc agacccattc tatattctaaaagcttctgc tccttccttc 120 ccaatttctc ctttgtagca ggaaattaca cccagccctcatctcaatta atgctaaata 180 aagctattgt ttttccaaaa cacaaatcta cactgggtctcaatatcagt gatgaggctt 240 acaaaccaac acgttttctg ccatgaggat ttctctttaggccagaagta caaaacaaaa 300 aaaccaatgg attttaacca aaatgatttg aaatataggtgaggattcag gagaaggcaa 360 aagctagaaa cacttggggt tgtcaacatg agtattacattaacattgct tgatgagaac 420 ctctaatgat actgacaaca taaattacct agggtaaaggatagctgcaa caatgaaaca 480 ggaaagaaga gagggagaga gaggaaaggg aaggaagaaaggaaggaggg agaagggaag 540 aaagaaacaa tgtctaaccc aaccctatct tgaaagttgaactcaagtag aaaaatggat 600 agaaacaaaa ttctctagta ctcatccagg aaaccattcttcaatgttgc atgtggctgt 660 ttgccaaggc acacaaagtg cttgtaggca gcaaccatatgctacaagaa ttgtaaactg 720 catacagttt gtttgaagta gacagtgagg ataataacaaagttgctagg caggaaaaaa 780 aatcaggaaa aaagcttgtc gctatttgag aatctgtatatttttaaagg cttaaaatat 840 tataaccaca gggtatccag ccaaattcaa cattactgcaagtcttagag atttaaacat 900 tcatttgatt catagctaaa tattcaccat aatccaggagggtctccttc cccactgcag 960 aggcagaacg tccaagaatg gagtaagatt agtcatagtaaagtctcagt ctgaatattt 1020 agcaagagaa acaggcagca gaggaaccca aaggcagtaaatcaaatatt ctaaaaccca 1080 aagttcatta ttttcatcca aaagactttc acagaaacacattactcaca gccatgtata 1140 tcttggacag agtttcagat ggaatgactt gtctgaaatttgtaaagctt aatataggtt 1200 ttgggggaat tattttaata ttcaaagaat gttttattatagtcctttgt gttaaaattt 1260 agccttacta attataacaa taactcataa agttctaaattcagaaggaa tgtctgttct 1320 ttatcaagtg tatgtaacta ttttttagaa atgccatctactttctagaa acactaaagt 1380 tattgttttc taagttaaat aactataatt tatatatctattaaaaaggt acttctcttc 1440 ccaaaaaaaa aaaaaaaaaa a 1461 43 854 DNA Homosapiens misc_feature Incyte ID No 3046106CT1 43 ttttgaagta tttttaaaaggggtttggag gtagcatccg aaatcatata aagattgggg 60 ataaatgttg aatttttgagatatggaatg tctattaaga ggtggaataa agattgtatg 120 tgtcatactc tttggaggaaagtggtcccc caaaatgaca gcaattccta aggagtttgt 180 gaaggggtac atgttggaatcatatagagt aaatatcata aaaactatcc atacattact 240 gttgcattgg caagagcacatcatttagaa tatacatcca attattaaat ttatttaata 300 ggcaagatgt tatagagaagacagttctca agattctttt tcagtttcca ttgactaaat 360 ttctaacttt agaaagctctgaatgtgaca tatttcgcca ttcttcagca agagtgatgt 420 caaacttaca tccccactttgcaaaaatat atcacttcaa tggaggtggc atataaacct 480 gaatttttat tttatggaaggttgctatgt gaatatacag agctgaaggt ttaggagggc 540 aactaagggt cttatcgtaccacatctctg gcccttattg aatgtttctt ttcctaagtc 600 cattcctgac tccagtttgctgtataatcc tgagactcct ttacagaata cggggatcta 660 acatgtagag actattcctgtaattggtgt ttcttggagg cattgcaaaa ccaaattttt 720 ctttactttg tagcacttttgactaatgtt atctaaggac tgtatcaaag aattggtttc 780 tattagattt tagtttaagaaatcttacaa ttttgttaca gagcaggcta tttggaggat 840 gaaactgaaa ttaa 854 44714 DNA Homo sapiens misc_feature Incyte ID No 3012947CB1 44 accctttcagtaatcattca accaacgctt ccatgtctct actctgtcgt aacaaaggct 60 gtgggcagcactttgaccct aataccaacc ttcctggtca gagttgcctc tgaagctgct 120 gccgctaaatatatcccaag ccctggaaat ggcattggaa cagaaggaat tagaccagga 180 acctggggcaggacttgaca gtctgatccg gactggttcc agctgccaga acccaggatg 240 tgatgctgtttaccaaggcc ctgagagtga tgctactcca tgtacctacc acccaggagc 300 accccgattccatgagggga tgaagtcttg gagctgttgt ggcatccaga ccctggattt 360 tggggcattcttggcacaac cagggtgcag agtcggtaga catgactggg ggaagcagct 420 cccagcatcttgccgccatg attggcacca gacagattcc ttagtagtgg tgactgtata 480 tggccagattccacttcctg cgtttaactg ggtgaaggcc agtcaaactg agcttcatgt 540 ccacattgtctttgatggta accgtgtgtt ccaagcacag atgaagctct ggggggtaag 600 tgaagaccaggggacacaag agtgggaggc agatgggtga aagagcggct agactggaat 660 agagggtgtcttgagggaag gagttgtact aggaaaatgg aggttttctc ttca 714 45 1434 DNA Homosapiens misc_feature Incyte ID No 466761CT1 45 caagaatgta tcctttcagctctctttggt tatacctgaa gccaggagcg ttgagttatt 60 agccttgtgt ttatattcctctcactgtaa ttggtgtcat tttcccagca gtcctagcag 120 tcctcaagca agtgggaaatcggaaaagaa aaggacaggc attgtaggga agcagaggat 180 aaagaattta gccaacaaaagaaacaatct agtcaatctg ggtgctttta tttcctgggt 240 actctctaaa catggctcagagctggtgta gatgaagtag gtgaaacctc tgaaaagagt 300 ctagaaggca gtagagcaagtcccagacca gaaacatgct catcttttca tcgtaatgtg 360 ccactcggta ctatttggtaatgtcactct atttttccta atcccatcct ttggtttgta 420 tttcatattt gtatataaggcaccattttc taaaaatatg actagggtgt gacctaaggt 480 tttattctgt gaagatgagtaactggaaag aagctaacac tgcagtggga aggaaggaag 540 agagttgtcc aggtggtagttcgacgtgtt ttgaatctag tccttcctac atggaggata 600 aaagctccta aagtccactctgggtttgtg attttaatag aaatagaaag ggaaactata 660 gaccaatgga gatgaaaatcaggggctatc gacagatgga ggagaaataa ggtgctacat 720 agagaaagga agagggcagaaggctttccc ttcccaaact gggtgagctg gggaagcctt 780 ggttcaggag agtggcactgcccacaactg ctttgtgggt tgtgcacttc cagccgcact 840 ctccccctcc agttgctgccttcagagccg tactgaagca cgagcttcaa taagacaagc 900 acacttcata gtgagagggcagcggtacca aagcctttca gagagactat ggattagaca 960 gaaatgattt gtgagaggaagctggagtga acagcatgaa cagcgagtgt tacctgacag 1020 aggcaagaca gctagaagtggcttcagatt tagaaacagc tgaggggagc aaagacggac 1080 tgtgtacaca gggagggaggatgtctatgg gcagagccct tggtgagtat catcaccaag 1140 aaaggcagtc cagagtagagatcagccgaa tatggaggct gaggtctgta gaactgggcc 1200 agagaggacc ttactgccttagtagcataa gggtctggaa aagaagtttc tatctcacaa 1260 caaaggaaaa agtgaaaagcaaggtggaac ttgaagatac gtcacgaaaa tcactataaa 1320 agtctgattt atgtgtgatgtcaaatcaaa ctgaaatgaa gaatgagatt gagtatatct 1380 gtggtgactg acctctgtatactagaaacc tcaacatctc tagaagagga aata 1434 46 2298 DNA Homo sapiensmisc_feature Incyte ID No 1644171CT1 46 tgagaaccaa ctcattttgg tatttttagtagagacgaaa ccccatcctc ccaaagtgct 60 gggattacag gcatgagctg ccgcacccggcctccacctg ggttttgagc caatcccctg 120 gacttgctcc tggtttcctc aaggggtggggcagtggttt aggacactcg acaactaaga 180 acaggagttc ccaggaagga caaggatctgcatcccccac tgccacttct ctgatgtgtt 240 cctcaaagct ggctcgaggg ctcgatcccttcatcggact caggagggga ctggttggtg 300 tatccaggta atttactctt ggaagtgactgtagtgaagg tcgtggaagg gctcagaggg 360 ttaattggtt tgcagtgcgt ctttgtctattgcatgtctt ggaaaactca gatcccaaag 420 gcgctgggtt tcagagagga cagtggagaccttgctcctt ttccttaggc cgccagtctc 480 tcaaatttca gaggaggctg tttccacaactcccctatgg aaacacttgg cagcggagtt 540 gctcctttgc agtttccaca ccatggcttttcctttcctt tcttctccat tccctgatgc 600 atcaacactt acttggagca atttcctaggagtcagaacc agcaccagcc actcggtgtc 660 ggtggccacc aaggcttaac attgaccttcccgcctgacc ttgatgcaga tgtccactga 720 acacaccgca ggaaagccag ggccttcaataccaataagt gtgaatatgt gtgtatgttg 780 tccaagagag attagggaga tcacatagactctagggagt agagaacttg taacagtctt 840 gcaaggctag catgcacggc tccacagcaggtggtgggga gcagaggggc aggacctgca 900 gggaagaagc agcctttgga tggtgaaatgtgcatggtgc acagtctgtg catgcccagg 960 agacccagcc cgggctgcct cgaggggctcctttgtacac agccagccgc ttctcttggg 1020 aacaagctgt cctgggggcc ttacccacgaggcaggagtc aggatgcacc agctcagcac 1080 caggaagtca tcctggaccc aggacagtggaaaggcaggc agagggagag gcactctgag 1140 gtcaggcagg gtaagccagt tggcagtcaggttaggtcta tgaggagaac ctcgagttag 1200 gaattcccgg ttctcagaat tgttatcactctggtgcatg ctgtcacagg ggccgttgcg 1260 tttggctttg tggagggcct ggacccttccacaagaacac ccgaggttcc agggcactca 1320 ggacaatgtt tccaaggaac gagtcgaccaggaaagaaca gtgagttctg caaggggcat 1380 ccacggagcc tgtgataggg gctgatgagatggaatctgt cctggacttt tcttctcatt 1440 aaccaccctc cgcaaacccc agaacccctcgcctcatctc tgtactgtct gccctcttgg 1500 gggatgggcc ctcccacttt cccctgcctgctcctccatg ctgtgagctg ctttggcaga 1560 tctgtttttc tgtgtagtca ggggaaaaacaaaaaaagat gcacaactgt gtgggcattg 1620 tcatagctgt tggtgtcacc actgctttgggggaaatggc tgggatgagg ctaatacatt 1680 catgcaatat ttatattttc agggggctgcgttatcagca tgctctccct gccttgggct 1740 tttctttccg tcatgttttc cttttcgtgttccttctctg atttctcttg tctctgctgc 1800 tcacaggcct gcccatcagt cagtacagatactcagtgtc tggtttctgg ccagctccgt 1860 ggagggggct ttaagcagaa ttctgactctttggggtggg ggattaggaa ctgggggaaa 1920 cttaatgatc cagagattcc cccaagaggagtgtctggaa ggatctgtgc ctggacagtg 1980 gcagaacctt tccagtgttc ttttggttctgatttcatca gtctcaataa agttccgatc 2040 tctctttaaa aaaaaaaaaa aacaaaaaaaaaaaaaaaaa aaaaaaaaaa agacaaaaaa 2100 aaaaaagggg gccccccaaa aggggggggaccccgcccca agcgcgaaag cgcctcaana 2160 gctttcccnn gaaaaaattt ttccccccccaaaattccag cccgctggtg gagtcgcctg 2220 tcnnnnnnnn nnnnnnnnnn nnnnnctnnnnnnnnnnnnn nnnnnnnnnn nnggnnncnn 2280 nnnnnnnnnn nnnnnccc 2298 47 728DNA Homo sapiens misc_feature Incyte ID No 3009806CB1 47 gacaatagggagaatggaga acgtggaggt cttcaccgct gagggcaaag gaaggggtct 60 gaaggccaccaaggagttct gggctgcaga tatcatcttt gctgagcggg cttattccgc 120 agtggtttttgacagccttg ttaattttgt gtgccacacc tgcttcaaga ggcaggagaa 180 gctccatcgctgtgggcagt gcaagtttgc ccattactgc gaccgcacct gccagaagga 240 tgcttggctgaaccacaaga atgaatgttc ggccatcaag agatatggga aggtgcccaa 300 tgagaacatcaggctggcgg cgcgcatcat gtggagggtg gagagagaag gcaccgggct 360 cacggagggctgcctggtgt ccgtggacga cttgcagaac cacgtggagc actttgggga 420 ggaggagcagaaggacctgc gggtggacgt ggacacattc ttgcagtact ggccggcgca 480 gagccagcagttcagcatgc agtacatctc gcacatcttc ggagtgatta actgcaacgg 540 ttttactctcagtgatcaga gaggcctgca cagcgtgggg cgtaaggatc tttccccacc 600 tggggctggtgaaccatgac tgttggccca actgtaactg gcaaatttta caatgggcat 660 cctgagggcattgaaatccc aaggttcatt accaagattg ggaatttgag cctccgggcc 720 ccttaggg 72848 1158 DNA Homo sapiens misc_feature Incyte ID No 5578191CB1 48cagctcgagg gacggcacca tggaggactc cgaggcggtg cagagggcca cagcgctcat 60cgagcagcgg ctggcacagg aggaggagaa tgagaaactc cgaggagaca cacgccagaa 120gctgcccatg gacttgctgg tgctggagga tgagaagcac cacggggctc agagtgcagc 180cctgcagaag gtgaagggcc aagagcgcgt gcgcaagacg tccctggacc tgcggcggga 240gatcatcgat gtgggcggga tccagaacct catcgagctg cggaagaaac gcaagcagaa 300gaagcgggac gctctggccg cctcgcatga gccgccccca gagcccgagg agatcactgg 360ccctgtggat gaggagacct tcctgaaagc tgcggtggag gggaaaatga aggtcattga 420gaagttcctg gctgacgggg ggtcagccga cacgtgcgac cagttccgtc ggacagcact 480gcaccgagct tccctggaag gccacatgga aatcctggag aagcttctag ataatggggc 540cactgtggac ttccaggatc ggctggactg cacagccatg cattgggcct gccgcggggg 600ccacttagag gtggtgaaac ttctgcaaag ccatggagca gacaccaatg tgagggataa 660gctgctgagc accccgctgc acgtggcagt ccggacaggg caggtggaga ttgtggagca 720ctttctatcc ctgggcctgg aaatcaatgc cagagacagg gaaggggata ctgccctgca 780tgacgctgtg aggctcaacc gctacaaaat catcaaactg ctgctcctgc atggggctga 840catgatgacc aagaacctgg caggaaagac cccgacggac ctggtgcagc tctggcaggc 900tgatacccgg cacgccctgg agcatcctga gccgggggct gagcataacg ggctggaggg 960gcctaatgat agtgggcgag agacccctca gcctgtgcca gcccagtgaa tgcgtgcccc 1020agcccagcca gctacccagc ccctctctgt gtgcagccgg agggtcctaa gaatggctcc 1080cggagctaac tgagggccca gccttttttc tgcatgatcc aggagcacat accacaaact 1140accacaataa aaaagctg 1158 49 70 PRT Homo sapiens misc_feature Incyte IDNo 3601719CD1 49 Met Leu Glu Pro Ser Arg Gln Ile Ser Ile Phe Gln Trp GluPro 1 5 10 15 Phe Gly Gln Glu Gln Val Asn Pro Pro Glu Glu Lys Asn ValLeu 20 25 30 Leu Lys Trp Arg Arg Val Phe Leu Pro Pro Arg Met Arg Arg Arg35 40 45 Ser Gln Phe Gln Glu Arg Arg Asn Phe Gln Asp Leu Gln Ser Ile 5055 60 Tyr Arg Lys Ser Arg Ile Leu Lys Val Asn 65 70 50 552 PRT Homosapiens misc_feature Incyte ID No 3445829CD1 50 Met Ser Thr Phe Gly TyrArg Arg Gly Leu Ser Lys Tyr Glu Ser 1 5 10 15 Ile Asp Glu Asp Glu LeuLeu Ala Ser Leu Ser Ala Glu Glu Leu 20 25 30 Lys Glu Leu Glu Arg Glu LeuGlu Asp Ile Glu Pro Asp Arg Asn 35 40 45 Leu Pro Val Gly Leu Arg Gln LysSer Leu Thr Glu Lys Thr Pro 50 55 60 Thr Gly Thr Phe Ser Arg Glu Ala LeuMet Ala Tyr Trp Glu Lys 65 70 75 Glu Ser Gln Lys Leu Leu Glu Lys Glu ArgLeu Gly Glu Cys Gly 80 85 90 Lys Val Ala Glu Asp Lys Glu Glu Ser Glu GluGlu Leu Ile Phe 95 100 105 Thr Glu Ser Asn Ser Glu Val Ser Glu Glu ValTyr Thr Glu Glu 110 115 120 Glu Glu Glu Glu Ser Gln Glu Glu Glu Glu GluGlu Asp Ser Asp 125 130 135 Glu Glu Glu Arg Thr Ile Glu Thr Ala Lys GlyIle Asn Gly Thr 140 145 150 Val Asn Tyr Asp Ser Val Asn Ser Asp Asn SerLys Pro Lys Ile 155 160 165 Phe Lys Ser Gln Ile Glu Asn Ile Asn Leu ThrAsn Gly Ser Asn 170 175 180 Gly Arg Asn Thr Glu Ser Pro Ala Ala Ile HisPro Cys Gly Asn 185 190 195 Pro Thr Val Ile Glu Asp Ala Leu Asp Lys IleLys Ser Asn Asp 200 205 210 Pro Asp Thr Thr Glu Val Asn Leu Asn Asn IleGlu Asn Ile Thr 215 220 225 Thr Gln Thr Leu Thr Arg Phe Ala Glu Ala LeuLys Asp Asn Thr 230 235 240 Val Val Lys Thr Phe Ser Leu Ala Asn Thr HisAla Asp Asp Ser 245 250 255 Ala Ala Met Ala Ile Ala Glu Met Leu Lys ValAsn Glu His Ile 260 265 270 Thr Asn Val Asn Val Glu Ser Asn Phe Ile ThrGly Lys Gly Ile 275 280 285 Leu Ala Ile Met Arg Ala Leu Gln His Asn ThrVal Leu Thr Glu 290 295 300 Leu Arg Phe His Asn Gln Arg His Ile Met GlySer Gln Val Glu 305 310 315 Met Glu Ile Val Lys Leu Leu Lys Glu Asn ThrThr Leu Leu Arg 320 325 330 Leu Gly Tyr His Phe Glu Leu Pro Gly Pro ArgMet Ser Met Thr 335 340 345 Ser Ile Leu Thr Arg Asn Met Asp Lys Gln ArgGln Lys Arg Leu 350 355 360 Gln Glu Gln Lys Gln Gln Glu Gly Tyr Asp GlyGly Pro Asn Leu 365 370 375 Arg Thr Lys Val Trp Gln Arg Gly Thr Pro SerSer Ser Pro Tyr 380 385 390 Val Ser Pro Arg His Ser Pro Trp Ser Ser ProLys Leu Pro Lys 395 400 405 Lys Val Gln Thr Val Arg Ser Arg Pro Leu SerPro Val Ala Thr 410 415 420 Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro ProPro Pro Ser Ser 425 430 435 Gln Arg Leu Pro Pro Pro Pro Pro Pro Pro ProPro Pro Leu Pro 440 445 450 Glu Lys Lys Leu Ile Thr Arg Asn Ile Ala GluVal Ile Lys Gln 455 460 465 Gln Glu Ser Ala Gln Arg Ala Leu Gln Asn GlyGln Lys Lys Lys 470 475 480 Lys Gly Lys Lys Val Lys Lys Gln Pro Asn SerIle Leu Lys Glu 485 490 495 Ile Lys Asn Ser Leu Arg Ser Val Gln Glu LysLys Met Glu Asp 500 505 510 Ser Ser Arg Pro Ser Thr Pro Gln Arg Ser AlaHis Glu Asn Leu 515 520 525 Met Glu Ala Ile Arg Gly Ser Ser Ile Lys GlnLeu Lys Arg Val 530 535 540 Ser Asn Gln Arg Thr Asp Ile Gly Ala Gln IleLys 545 550 51 260 PRT Homo sapiens misc_feature Incyte ID No 2837330CD151 Met Ser Leu Leu Trp Thr Pro Lys Gly Lys Met Arg Leu Gln Ala 1 5 10 15Glu Lys Leu Asn Lys Ala Pro Gln Gly Gly Ile Gly Thr Ala Ala 20 25 30 ValArg Pro Lys Ser Leu Ala Ile Ser Ser Ser Leu Val Ser Asp 35 40 45 Val ValArg Pro Lys Thr Gln Gly Thr Asp Leu Lys Thr Ser Ser 50 55 60 His Pro GluMet Leu His Gly Met Ala Pro Gln Gln Lys His Gly 65 70 75 Gln Gln Tyr LysThr Lys Ser Ser Tyr Lys Ala Phe Ala Ala Phe 80 85 90 Pro Thr Asn Thr LeuLeu Leu Glu Gln Lys Thr Pro Thr Thr Leu 95 100 105 Pro Arg Ala Ala GlyArg Glu Thr Lys Tyr Ala Asn Leu Ser Ser 110 115 120 Pro Thr Ser Thr ValSer Glu Ser Gln Leu Thr Lys Pro Gly Val 125 130 135 Ile Arg Pro Val ProVal Lys Ser Arg Ile Leu Leu Lys Lys Glu 140 145 150 Glu Glu Val Tyr GluPro Asn Pro Phe Ser Lys Tyr Leu Glu Asp 155 160 165 Asn Ser Asp Leu PheSer Glu Gln Asp Val Thr Val Pro Pro Lys 170 175 180 Pro Val Ser Leu HisPro Leu Tyr Gln Thr Lys Leu Tyr Pro Pro 185 190 195 Ala Lys Ser Leu LeuHis Pro Gln Thr Leu Ser His Ala Asp Cys 200 205 210 Leu Ala Pro Gly ProPhe Ser His Leu Ser Phe Ser Leu Ser Asp 215 220 225 Glu Gln Glu Asn SerHis Thr Leu Leu Ser His Asn Ala Cys Asn 230 235 240 Lys Leu Ser His ProMet Val Ala Ile Pro Glu His Glu Ala Leu 245 250 255 Asp Ser Lys Glu Gln260 52 364 PRT Homo sapiens misc_feature Incyte ID No 1737459CD1 52 MetSer Ala Asn Ser Ser Arg Val Gly Gln Leu Leu Leu Gln Gly 1 5 10 15 SerAla Cys Ile Arg Trp Lys Gln Asp Val Glu Gly Ala Ile Tyr 20 25 30 His LeuAla Asn Cys Leu Leu Leu Leu Gly Phe Met Gly Gly Ser 35 40 45 Gly Val TyrGly Cys Phe Tyr Leu Phe Gly Phe Leu Ser Ala Gly 50 55 60 Tyr Leu Cys CysVal Leu Trp Gly Trp Phe Ser Ala Cys Gly Leu 65 70 75 Asp Ile Val Leu TrpSer Phe Leu Leu Ala Val Val Cys Leu Leu 80 85 90 Gln Leu Ala His Leu ValTyr Arg Leu Arg Glu Asp Thr Leu Pro 95 100 105 Glu Glu Phe Asp Leu LeuTyr Lys Thr Leu Cys Leu Pro Leu Gln 110 115 120 Val Pro Leu Gln Thr TyrLys Glu Ile Val His Cys Cys Glu Glu 125 130 135 Gln Val Leu Thr Leu AlaThr Glu Gln Thr Tyr Ala Val Glu Gly 140 145 150 Glu Thr Pro Ile Asn ArgLeu Ser Leu Leu Leu Ser Gly Arg Val 155 160 165 Arg Val Ser Gln Asp GlyGln Phe Leu His Tyr Ile Phe Pro Tyr 170 175 180 Gln Phe Met Asp Ser ProGlu Trp Glu Ser Leu Gln Pro Ser Glu 185 190 195 Glu Gly Val Phe Gln ValThr Leu Thr Ala Glu Thr Ser Cys Ser 200 205 210 Tyr Ile Ser Trp Pro ArgLys Ser Leu His Leu Leu Leu Thr Lys 215 220 225 Glu Arg Tyr Ile Ser CysLeu Phe Ser Ala Leu Leu Gly Tyr Asp 230 235 240 Ile Ser Glu Lys Leu TyrThr Leu Asn Asp Lys Leu Phe Ala Lys 245 250 255 Phe Gly Leu Arg Phe AspIle Arg Leu Pro Ser Leu Tyr His Val 260 265 270 Leu Gly Pro Thr Ala AlaAsp Ala Gly Pro Glu Ser Glu Lys Gly 275 280 285 Asp Glu Glu Val Cys GluPro Ala Val Ser Pro Pro Gln Ala Thr 290 295 300 Pro Thr Ser Leu Gln GlnThr Pro Pro Cys Ser Thr Pro Pro Ala 305 310 315 Thr Thr Asn Phe Pro AlaPro Pro Thr Arg Ala Arg Leu Ser Arg 320 325 330 Pro Asp Ser Gly Ile LeuAla Ser Arg Ile Pro Leu Gln Ser Tyr 335 340 345 Ser Gln Val Ile Ser ArgGly Gln Ala Pro Leu Ala Pro Thr His 350 355 360 Thr Pro Glu Leu 53 527PRT Homo sapiens misc_feature Incyte ID No 058201CD1 53 Met Glu Cys LeuVal Ala Asp Lys Gln Asn Phe His Lys Ser Cys 1 5 10 15 Phe Arg Cys HisHis Cys Asn Ser Lys Leu Ser Leu Gly Asn Tyr 20 25 30 Ala Ser Leu His GlyGln Ile Tyr Cys Lys Pro His Phe Lys Gln 35 40 45 Leu Phe Lys Ser Lys GlyAsn Tyr Asp Glu Gly Phe Gly His Lys 50 55 60 Gln His Lys Asp Arg Trp AsnCys Lys Asn Gln Ser Arg Ser Val 65 70 75 Asp Phe Ile Pro Asn Glu Glu ProAsn Met Cys Lys Asn Ile Ala 80 85 90 Glu Asn Thr Leu Val Pro Gly Asp ArgAsn Glu His Leu Asp Ala 95 100 105 Gly Asn Ser Glu Gly Gln Arg Asn AspLeu Arg Lys Leu Gly Glu 110 115 120 Arg Gly Lys Leu Lys Val Ile Trp ProPro Ser Lys Glu Ile Pro 125 130 135 Lys Lys Thr Leu Pro Phe Glu Glu GluLeu Lys Met Ser Lys Pro 140 145 150 Lys Trp Pro Pro Glu Met Thr Thr LeuLeu Ser Pro Glu Phe Lys 155 160 165 Ser Glu Ser Leu Leu Glu Asp Val ArgThr Pro Glu Asn Lys Gly 170 175 180 Gln Arg Gln Asp His Phe Pro Phe LeuGln Pro Tyr Leu Gln Ser 185 190 195 Thr His Val Cys Gln Lys Glu Asp ValIle Gly Ile Lys Glu Met 200 205 210 Lys Met Pro Glu Gly Arg Lys Asp GluLys Lys Glu Gly Arg Lys 215 220 225 Asn Val Gln Asp Arg Pro Ser Glu AlaGlu Asp Thr Lys Ser Asn 230 235 240 Arg Lys Ser Ala Met Asp Leu Asn AspAsn Asn Asn Val Ile Val 245 250 255 Gln Ser Ala Glu Lys Glu Lys Asn GluLys Thr Asn Gln Thr Asn 260 265 270 Gly Ala Glu Val Leu Gln Val Thr AsnThr Asp Asp Glu Met Met 275 280 285 Pro Glu Asn His Lys Glu Asn Leu AsnLys Asn Asn Asn Asn Asn 290 295 300 Tyr Val Ala Val Ser Tyr Leu Asn AsnCys Arg Gln Lys Thr Ser 305 310 315 Ile Leu Glu Phe Leu Asp Leu Leu ProLeu Ser Ser Glu Ala Asn 320 325 330 Asp Thr Ala Asn Glu Tyr Glu Ile GluLys Leu Glu Asn Thr Ser 335 340 345 Arg Ile Ser Glu Leu Leu Gly Ile PheGlu Ser Glu Lys Thr Tyr 350 355 360 Ser Arg Asn Val Leu Ala Met Ala LeuLys Lys Gln Thr Asp Arg 365 370 375 Ala Ala Ala Gly Ser Pro Val Gln ProAla Pro Lys Pro Ser Leu 380 385 390 Ser Arg Gly Leu Met Val Lys Gly GlySer Ser Ile Ile Ser Pro 395 400 405 Asp Thr Asn Leu Leu Asn Ile Lys GlySer His Ser Lys Ser Lys 410 415 420 Asn Leu His Phe Phe Phe Ser Asn ThrVal Lys Ile Thr Ala Phe 425 430 435 Ser Lys Lys Asn Glu Asn Ile Phe AsnCys Asp Leu Ile Asp Ser 440 445 450 Val Asp Gln Ile Lys Asn Met Pro CysLeu Asp Leu Arg Glu Phe 455 460 465 Gly Lys Asp Val Lys Pro Trp His ValGlu Thr Thr Glu Ala Ala 470 475 480 Arg Asn Asn Glu Asn Thr Gly Phe AspAla Leu Ser His Glu Cys 485 490 495 Thr Ala Lys Pro Leu Phe Pro Arg ValGlu Val Gln Ser Glu Gln 500 505 510 Leu Thr Val Glu Glu Gln Ile Lys ArgAsn Arg Cys Tyr Ser Asp 515 520 525 Thr Glu 54 82 PRT Homo sapiensmisc_feature Incyte ID No 5449893CD1 54 Met Ser Gln Ala Gly Ala Gln GluAla Pro Ile Lys Lys Lys Arg 1 5 10 15 Pro Pro Val Lys Glu Glu Asp LeuLys Gly Ala Arg Gly Asn Leu 20 25 30 Thr Lys Asn Gln Glu Ile Lys Ser LysThr Tyr Gln Val Met Arg 35 40 45 Glu Cys Glu Gln Ala Gly Ser Ala Ala ProSer Val Phe Ser Arg 50 55 60 Thr Arg Thr Gly Thr Glu Thr Val Phe Glu LysPro Lys Ala Gly 65 70 75 Pro Thr Lys Ser Val Phe Gly 80 55 302 PRT Homosapiens misc_feature Incyte ID No 282977CD1 55 Met Asn Val Gln Pro CysSer Arg Cys Gly Tyr Gly Val Tyr Pro 1 5 10 15 Ala Glu Lys Ile Ser CysIle Asp Gln Ile Trp His Lys Ala Cys 20 25 30 Phe His Cys Glu Val Cys LysMet Met Leu Ser Val Asn Asn Phe 35 40 45 Val Ser His Gln Lys Lys Pro TyrCys His Ala His Asn Pro Lys 50 55 60 Asn Asn Thr Phe Thr Ser Val Tyr HisThr Pro Leu Asn Leu Asn 65 70 75 Val Arg Thr Phe Pro Glu Ala Ile Ser GlyIle His Asp Gln Glu 80 85 90 Asp Gly Glu Gln Cys Lys Ser Val Phe His TrpAsp Met Lys Ser 95 100 105 Lys Asp Lys Glu Gly Ala Pro Asn Arg Gln ProLeu Ala Asn Glu 110 115 120 Arg Ala Tyr Trp Thr Gly Tyr Gly Glu Gly AsnAla Trp Cys Pro 125 130 135 Gly Ala Leu Pro Asp Pro Glu Ile Val Arg MetVal Glu Ala Arg 140 145 150 Lys Ser Leu Gly Glu Glu Tyr Thr Glu Asp TyrGlu Gln Pro Arg 155 160 165 Gly Lys Gly Ser Phe Pro Ala Met Ile Thr ProAla Tyr Gln Arg 170 175 180 Ala Lys Lys Ala Asn Gln Leu Ala Ser Gln ValGlu Tyr Lys Arg 185 190 195 Gly His Asp Glu Arg Ile Ser Arg Phe Ser ThrVal Ala Asp Thr 200 205 210 Pro Glu Leu Leu Arg Ser Lys Ala Gly Ala GlnLeu Gln Ser Asp 215 220 225 Val Arg Tyr Thr Glu Asp Tyr Glu Gln Gln ArgGly Lys Gly Ser 230 235 240 Phe Pro Ala Met Ile Thr Pro Ala Tyr Gln IleAla Lys Arg Ala 245 250 255 Asn Glu Leu Ala Ser Asp Val Arg Tyr His GlnGln Tyr Gln Lys 260 265 270 Glu Met Arg Gly Met Ala Gly Pro Ala Ile GlyAla Glu Gly Ile 275 280 285 Leu Thr Arg Glu Cys Ala Asp Gln Tyr Gly HisGly Tyr Pro Glu 290 295 300 Glu Tyr 56 193 PRT Homo sapiens misc_featureIncyte ID No 3178454CD1 56 Met Asn Thr Ser Phe Ser Asp Ile Glu Leu LeuGlu Asp Ser Gly 1 5 10 15 Ile Pro Thr Glu Ala Phe Leu Ala Ser Cys CysAla Val Val Pro 20 25 30 Val Leu Asp Lys Leu Gly Pro Thr Val Phe Ala ProVal Lys Met 35 40 45 Asp Leu Val Glu Asn Ile Lys Lys Val Asn Gln Lys TyrIle Thr 50 55 60 Asn Lys Glu Glu Phe Thr Thr Leu Gln Lys Ile Val Leu HisGlu 65 70 75 Val Glu Ala Asp Val Ala Gln Val Arg Asn Ser Ala Thr Glu Ala80 85 90 Leu Leu Trp Leu Lys Arg Gly Leu Lys Phe Leu Lys Gly Phe Leu 95100 105 Thr Glu Val Lys Asn Gly Glu Lys Asp Ile Gln Thr Ala Leu Asn 110115 120 Asn Ala Tyr Gly Lys Thr Leu Arg Gln His His Gly Trp Val Val 125130 135 Arg Gly Val Phe Ala Leu Ala Leu Arg Ala Thr Pro Ser Tyr Glu 140145 150 Asp Phe Val Ala Ala Leu Thr Val Lys Glu Gly Asp His Arg Lys 155160 165 Glu Ala Phe Ser Ile Gly Met Gln Arg Asp Leu Ser Leu Tyr Leu 170175 180 Pro Ala Met Lys Lys Gln Met Ala Ile Leu Asp Ala Leu 185 190 57174 PRT Homo sapiens misc_feature Incyte ID No 4152861CD1 57 Met Ser AsnGly Tyr Arg Thr Leu Ser Gln His Leu Asn Asp Leu 1 5 10 15 Lys Lys GluAsn Phe Ser Leu Lys Leu Arg Ile Tyr Phe Leu Glu 20 25 30 Glu Arg Met GlnGln Lys Tyr Glu Ala Ser Arg Glu Asp Ile Tyr 35 40 45 Lys Arg Asn Thr GluLeu Lys Val Glu Val Glu Ser Leu Lys Arg 50 55 60 Glu Leu Gln Asp Lys LysGln His Leu Asp Lys Thr Trp Ala Asp 65 70 75 Val Glu Asn Leu Asn Ser GlnAsn Glu Ala Glu Leu Arg Arg Gln 80 85 90 Phe Glu Glu Arg Gln Gln Glu ThrGlu His Val Tyr Glu Leu Leu 95 100 105 Glu Asn Lys Met Gln Leu Leu GlnGlu Glu Ser Arg Leu Ala Lys 110 115 120 Asn Glu Ala Ala Arg Met Ala AlaLeu Val Glu Ala Glu Lys Glu 125 130 135 Cys Asn Leu Glu Leu Ser Glu LysLeu Lys Gly Val Thr Lys Asn 140 145 150 Trp Glu Asp Val Pro Gly Asp GlnVal Lys Pro Asp Gln Tyr Thr 155 160 165 Glu Ala Leu Ala Gln Arg Asp LysIle 170 58 230 PRT Homo sapiens misc_feature Incyte ID No 3009303CD1 58Met Val Gly Val Arg Glu Pro Leu Val Phe Arg Val Asp Ala Arg 1 5 10 15Gly Ser Val Asp Trp Ala Ala Ser Gly Met Gly Ser Leu Glu Glu 20 25 30 GluGly Thr Met Glu Glu Ala Gly Glu Glu Glu Gly Glu Asp Gly 35 40 45 Asp AlaPhe Val Thr Glu Glu Ser Gln Asp Thr His Ser Leu Gly 50 55 60 Asp Arg AspPro Lys Ile Leu Thr His Asn Gly Arg Met Leu Thr 65 70 75 Leu Ala Asp LeuGlu Asp Tyr Val Pro Gly Glu Gly Glu Thr Phe 80 85 90 His Cys Gly Gly ProGly Pro Gly Ala Pro Asp Asp Pro Pro Cys 95 100 105 Glu Val Ser Val IleGln Arg Glu Ile Gly Glu Pro Thr Val Gly 110 115 120 Gln Pro Val Leu LeuSer Val Gly His Ala Leu Gly Pro Arg Gly 125 130 135 Pro Leu Gly Leu PheArg Pro Glu Pro Arg Gly Ala Ser Pro Pro 140 145 150 Gly Pro Gln Val ArgSer Leu Glu Gly Thr Ser Phe Leu Leu Arg 155 160 165 Glu Ala Pro Ala ArgPro Val Gly Ser Ala Pro Trp Thr Gln Ser 170 175 180 Phe Cys Thr Arg IleArg Arg Ser Ala Asp Ser Gly Gln Ser Ser 185 190 195 Phe Thr Thr Glu LeuSer Thr Gln Thr Val Asn Phe Gly Thr Val 200 205 210 Gly Glu Thr Val ThrLeu His Ile Cys Pro Asp Arg Asp Gly Asp 215 220 225 Glu Ala Ala Gln Pro230 59 915 PRT Homo sapiens misc_feature Incyte ID No 4151935CD1 59 MetPro Leu Phe Glu Ala Glu Glu Gly Val Leu Ser Arg Thr Gln 1 5 10 15 IlePhe Pro Thr Thr Ile Lys Val Ile Asp Pro Glu Phe Leu Glu 20 25 30 Glu ProPro Ala Leu Ala Phe Leu Tyr Lys Asp Leu Tyr Glu Glu 35 40 45 Ala Val GlyGlu Lys Lys Lys Glu Glu Glu Thr Ala Ser Glu Gly 50 55 60 Asp Ser Val AsnSer Glu Ala Ser Phe Pro Ser Arg Asn Ser Asp 65 70 75 Thr Asp Asp Gly ThrGly Ile Tyr Phe Glu Lys Tyr Ile Leu Lys 80 85 90 Asp Asp Ile Leu His AspThr Ser Leu Thr Gln Lys Asp Gln Gly 95 100 105 Gln Gly Leu Glu Glu LysArg Val Gly Lys Asp Asp Ser Tyr Gln 110 115 120 Pro Ile Ala Ala Glu GlyGlu Ile Trp Gly Lys Phe Gly Thr Ile 125 130 135 Cys Arg Glu Lys Ser LeuGlu Glu Gln Lys Gly Val Tyr Gly Glu 140 145 150 Gly Glu Ser Val Asp HisVal Glu Thr Val Gly Asn Val Ala Met 155 160 165 Gln Lys Lys Ala Pro IleThr Glu Asp Val Arg Val Ala Thr Gln 170 175 180 Lys Ile Ser Tyr Ala ValPro Phe Glu Asp Thr His His Val Leu 185 190 195 Glu Arg Ala Asp Glu AlaGly Ser His Gly Asn Glu Val Gly Asn 200 205 210 Ala Ser Pro Glu Val AsnLeu Asn Val Pro Val Gln Val Ser Phe 215 220 225 Pro Glu Glu Glu Phe AlaSer Gly Ala Thr His Val Gln Glu Thr 230 235 240 Ser Leu Glu Glu Pro LysIle Leu Val Pro Pro Glu Pro Ser Glu 245 250 255 Glu Arg Leu Arg Asn SerPro Val Gln Asp Glu Tyr Glu Phe Thr 260 265 270 Glu Ser Leu His Asn GluVal Val Pro Gln Asp Ile Leu Ser Glu 275 280 285 Glu Leu Ser Ser Glu SerThr Pro Glu Asp Val Leu Ser Gln Gly 290 295 300 Lys Glu Ser Phe Glu HisIle Ser Glu Asn Glu Phe Ala Ser Glu 305 310 315 Ala Glu Gln Ser Thr ProAla Glu Gln Lys Glu Leu Gly Ser Glu 320 325 330 Arg Lys Glu Glu Asp GlnLeu Ser Ser Glu Val Val Thr Glu Lys 335 340 345 Ala Gln Lys Glu Leu LysLys Ser Gln Ile Asp Thr Tyr Cys Tyr 350 355 360 Thr Cys Lys Cys Pro IleSer Ala Thr Asp Lys Val Phe Gly Thr 365 370 375 His Lys Asp His Glu ValSer Thr Leu Asp Thr Ala Ile Ser Ala 380 385 390 Val Lys Val Gln Leu AlaGlu Phe Leu Glu Asn Leu Gln Glu Lys 395 400 405 Ser Leu Arg Ile Glu AlaPhe Val Ser Glu Ile Glu Ser Phe Phe 410 415 420 Asn Thr Ile Glu Glu AsnCys Ser Lys Asn Glu Lys Arg Leu Glu 425 430 435 Glu Gln Asn Glu Glu MetMet Lys Lys Val Leu Ala Gln Tyr Asp 440 445 450 Glu Lys Ala Gln Ser PheGlu Glu Val Lys Lys Lys Lys Met Glu 455 460 465 Phe Leu His Glu Gln MetVal His Phe Leu Gln Ser Met Asp Thr 470 475 480 Ala Lys Asp Thr Leu GluThr Ile Val Arg Glu Ala Glu Glu Leu 485 490 495 Asp Glu Ala Val Phe LeuThr Ser Phe Glu Glu Ile Asn Glu Arg 500 505 510 Leu Leu Ser Ala Met GluSer Thr Ala Ser Leu Glu Lys Met Pro 515 520 525 Ala Ala Phe Ser Leu PheGlu His Tyr Asp Asp Ser Ser Ala Arg 530 535 540 Ser Asp Gln Met Leu LysGln Val Ala Val Pro Gln Pro Pro Arg 545 550 555 Leu Glu Pro Gln Glu ProAsn Ser Ala Thr Ser Thr Thr Ile Ala 560 565 570 Val Tyr Trp Ser Met AsnLys Glu Asp Val Ile Asp Ser Phe Gln 575 580 585 Val Tyr Cys Met Glu GluPro Gln Asp Asp Gln Glu Val Asn Glu 590 595 600 Leu Val Glu Glu Tyr ArgLeu Thr Val Lys Glu Ser Tyr Cys Ile 605 610 615 Phe Glu Asp Leu Glu ProAsp Arg Cys Tyr Gln Val Trp Val Met 620 625 630 Ala Val Asn Phe Thr GlyCys Ser Leu Pro Ser Glu Arg Ala Ile 635 640 645 Phe Arg Thr Ala Pro SerThr Pro Val Ile Arg Ala Glu Asp Cys 650 655 660 Thr Val Cys Trp Asn ThrAla Thr Ile Arg Trp Arg Pro Thr Thr 665 670 675 Pro Glu Ala Thr Glu ThrTyr Thr Leu Glu Tyr Cys Arg Gln His 680 685 690 Ser Pro Glu Gly Glu GlyLeu Arg Ser Phe Ser Gly Ile Lys Gly 695 700 705 Leu Gln Leu Lys Val AsnLeu Gln Pro Asn Asp Asn Tyr Phe Phe 710 715 720 Tyr Val Arg Ala Ile AsnAla Phe Gly Thr Ser Glu Gln Ser Glu 725 730 735 Ala Ala Leu Ile Ser ThrArg Gly Thr Arg Phe Leu Leu Leu Arg 740 745 750 Glu Thr Ala His Pro AlaLeu His Ile Ser Ser Ser Gly Thr Val 755 760 765 Ile Ser Phe Gly Glu ArgArg Arg Leu Thr Glu Ile Pro Ser Val 770 775 780 Leu Gly Glu Glu Leu ProSer Cys Gly Gln His Tyr Trp Glu Thr 785 790 795 Thr Val Thr Asp Cys ProAla Tyr Arg Leu Gly Ile Cys Ser Ser 800 805 810 Ser Ala Val Gln Ala GlyAla Leu Gly Gln Gly Glu Thr Ser Trp 815 820 825 Tyr Met His Cys Ser GluPro Gln Arg Tyr Thr Phe Phe Tyr Ser 830 835 840 Gly Ile Val Ser Asp ValHis Val Thr Glu Arg Pro Ala Arg Val 845 850 855 Gly Ile Leu Leu Asp TyrAsn Asn Gln Arg Leu Ile Phe Ile Asn 860 865 870 Ala Glu Ser Glu Gln LeuLeu Phe Ile Ile Arg His Arg Phe Asn 875 880 885 Glu Gly Val His Pro AlaPhe Ala Leu Glu Lys Pro Gly Lys Cys 890 895 900 Thr Leu His Leu Gly IleGlu Pro Pro Asp Ser Val Arg His Lys 905 910 915 60 163 PRT Homo sapiensmisc_feature Incyte ID No 3012947CD1 60 Met Ala Leu Glu Gln Lys Glu LeuAsp Gln Glu Pro Gly Ala Gly 1 5 10 15 Leu Asp Ser Leu Ile Arg Thr GlySer Ser Cys Gln Asn Pro Gly 20 25 30 Cys Asp Ala Val Tyr Gln Gly Pro GluSer Asp Ala Thr Pro Cys 35 40 45 Thr Tyr His Pro Gly Ala Pro Arg Phe HisGlu Gly Met Lys Ser 50 55 60 Trp Ser Cys Cys Gly Ile Gln Thr Leu Asp PheGly Ala Phe Leu 65 70 75 Ala Gln Pro Gly Cys Arg Val Gly Arg His Asp TrpGly Lys Gln 80 85 90 Leu Pro Ala Ser Cys Arg His Asp Trp His Gln Thr AspSer Leu 95 100 105 Val Val Val Thr Val Tyr Gly Gln Ile Pro Leu Pro AlaPhe Asn 110 115 120 Trp Val Lys Ala Ser Gln Thr Glu Leu His Val His IleVal Phe 125 130 135 Asp Gly Asn Arg Val Phe Gln Ala Gln Met Lys Leu TrpGly Val 140 145 150 Ser Glu Asp Gln Gly Thr Gln Glu Trp Glu Ala Asp Gly155 160 61 201 PRT Homo sapiens misc_feature Incyte ID No 3009806CD1 61Met Glu Asn Val Glu Val Phe Thr Ala Glu Gly Lys Gly Arg Gly 1 5 10 15Leu Lys Ala Thr Lys Glu Phe Trp Ala Ala Asp Ile Ile Phe Ala 20 25 30 GluArg Ala Tyr Ser Ala Val Val Phe Asp Ser Leu Val Asn Phe 35 40 45 Val CysHis Thr Cys Phe Lys Arg Gln Glu Lys Leu His Arg Cys 50 55 60 Gly Gln CysLys Phe Ala His Tyr Cys Asp Arg Thr Cys Gln Lys 65 70 75 Asp Ala Trp LeuAsn His Lys Asn Glu Cys Ser Ala Ile Lys Arg 80 85 90 Tyr Gly Lys Val ProAsn Glu Asn Ile Arg Leu Ala Ala Arg Ile 95 100 105 Met Trp Arg Val GluArg Glu Gly Thr Gly Leu Thr Glu Gly Cys 110 115 120 Leu Val Ser Val AspAsp Leu Gln Asn His Val Glu His Phe Gly 125 130 135 Glu Glu Glu Gln LysAsp Leu Arg Val Asp Val Asp Thr Phe Leu 140 145 150 Gln Tyr Trp Pro AlaGln Ser Gln Gln Phe Ser Met Gln Tyr Ile 155 160 165 Ser His Ile Phe GlyVal Ile Asn Cys Asn Gly Phe Thr Leu Ser 170 175 180 Asp Gln Arg Gly LeuHis Ser Val Gly Arg Lys Asp Leu Ser Pro 185 190 195 Pro Gly Ala Gly GluPro 200 62 329 PRT Homo sapiens misc_feature Incyte ID No 5578191CD1 62Met Glu Asp Ser Glu Ala Val Gln Arg Ala Thr Ala Leu Ile Glu 1 5 10 15Gln Arg Leu Ala Gln Glu Glu Glu Asn Glu Lys Leu Arg Gly Asp 20 25 30 ThrArg Gln Lys Leu Pro Met Asp Leu Leu Val Leu Glu Asp Glu 35 40 45 Lys HisHis Gly Ala Gln Ser Ala Ala Leu Gln Lys Val Lys Gly 50 55 60 Gln Glu ArgVal Arg Lys Thr Ser Leu Asp Leu Arg Arg Glu Ile 65 70 75 Ile Asp Val GlyGly Ile Gln Asn Leu Ile Glu Leu Arg Lys Lys 80 85 90 Arg Lys Gln Lys LysArg Asp Ala Leu Ala Ala Ser His Glu Pro 95 100 105 Pro Pro Glu Pro GluGlu Ile Thr Gly Pro Val Asp Glu Glu Thr 110 115 120 Phe Leu Lys Ala AlaVal Glu Gly Lys Met Lys Val Ile Glu Lys 125 130 135 Phe Leu Ala Asp GlyGly Ser Ala Asp Thr Cys Asp Gln Phe Arg 140 145 150 Arg Thr Ala Leu HisArg Ala Ser Leu Glu Gly His Met Glu Ile 155 160 165 Leu Glu Lys Leu LeuAsp Asn Gly Ala Thr Val Asp Phe Gln Asp 170 175 180 Arg Leu Asp Cys ThrAla Met His Trp Ala Cys Arg Gly Gly His 185 190 195 Leu Glu Val Val LysLeu Leu Gln Ser His Gly Ala Asp Thr Asn 200 205 210 Val Arg Asp Lys LeuLeu Ser Thr Pro Leu His Val Ala Val Arg 215 220 225 Thr Gly Gln Val GluIle Val Glu His Phe Leu Ser Leu Gly Leu 230 235 240 Glu Ile Asn Ala ArgAsp Arg Glu Gly Asp Thr Ala Leu His Asp 245 250 255 Ala Val Arg Leu AsnArg Tyr Lys Ile Ile Lys Leu Leu Leu Leu 260 265 270 His Gly Ala Asp MetMet Thr Lys Asn Leu Ala Gly Lys Thr Pro 275 280 285 Thr Asp Leu Val GlnLeu Trp Gln Ala Asp Thr Arg His Ala Leu 290 295 300 Glu His Pro Glu ProGly Ala Glu His Asn Gly Leu Glu Gly Pro 305 310 315 Asn Asp Ser Gly ArgGlu Thr Pro Gln Pro Val Pro Ala Gln 320 325

What is claimed is:
 1. A composition comprising a plurality ofpolynucleotides wherein the polynucleotides have the nucleic acidsequences of SEQ ID NOs: 1-48 or the complements of SEQ ID NOs: 1-48. 2.An isolated polynucleotide comprising a nucleic acid sequence selectedfrom SEQ ID NOs: 1-48 and the complements thereof.
 3. A compositioncomprising a polynucleotide of claim 2 and a labeling moiety.
 4. Amethod of using a polynucleotide to screen a plurality of molecules toidentify at least one ligand which specifically binds thepolynucleotide, the method comprising: a) combining the composition ofclaim 1 with a plurality of molecules under conditions to allow specificbinding; and b) detecting specific binding, thereby identifying a ligandwhich specifically binds a polynucleotide.
 5. The method of claim 4wherein the composition is attached to a substrate.
 6. The method ofclaim 4 wherein the molecules to be screened are selected from DNAmolecules, RNA molecules, peptide nucleic acids, mimetics, and proteins.7. A method of using a polynucleotide to purify a ligand, the methodcomprising: a) combining the polynucleotide of claim 2 with a sampleunder conditions to allow specific binding; b) recovering the boundpolynucleotide; and c) separating the ligand from the boundpolynucleotide, thereby obtaining purified ligand.
 8. The method ofclaim 7 wherein the polynucleotide is attached to a substrate.
 9. Amethod for using a polynucleotide to detect gene expression in a sample,the method comprising: a) hybridizing the composition of claim 1 to asample thereby forming at least one hybridization complex; b) detectingcomplex formation, wherein complex formation indicates gene expressionin the sample.
 10. The method of claim 9 wherein the polynucleotides ofthe composition are attached to a substrate.
 11. The method of claim 9wherein the sample is selected from blood or cells or tissues of theheart or vasculature.
 12. The method of claim 9 wherein gene expressionis compared to standards and indicates the presence of atherosclerosis,arteriosclerosis, atrial fibrillation, cancer (myxoma), complications ofcancer, cardiac injury, congestive heart failure, coronary arterydisease, hypertension, hypertrophic cardiomyopathy, myocardialhypertrophy, myocardial infarction, or plaque.
 13. A vector comprising apolynucleotide of claim
 2. 14. A host cell comprising the vector ofclaim
 13. 15. A method for using a host cell to produce a protein, themethod comprising: a) culturing the host cell of claim 14 underconditions for expression of the protein; and b) recovering the proteinfrom cell culture.
 16. A purified protein or a portion thereofcomprising an amino acid sequence selected from SEQ ID NO: 49-62.
 17. Acomposition comprising the protein of claim 16 and a pharmaceuticalcarrier or a labeling moiety.
 18. A method for using a protein to screena plurality of molecules to identify at least one ligand whichspecifically binds the protein, the method comprising: a) combining theprotein of claim 16 with the plurality of molecules under conditions toallow specific binding; and b) detecting specific binding between theprotein and ligand, thereby identifying a ligand which specificallybinds the polypeptide.
 19. The method of claim 18 wherein the pluralityof molecules is selected from DNA molecules, RNA molecules, peptidenucleic acids, mimetics, proteins, agonists, antagonists, andantibodies.
 20. A method of using a protein to prepare and purifyantibodies comprising: a) immunizing a animal with the protein of claim16 under conditions to elicit an antibody response; b) isolating animalantibodies; c) attaching the protein to a substrate; d) contacting thesubstrate with isolated antibodies under conditions to allow specificbinding to the protein; e) dissociating the antibodies from the protein,thereby obtaining purified antibodies.